Site Reliability Engineer (SRE)
We are looking for an SRE to strengthen our team. You will be responsible for designing, deploying, and maintaining reliable, scalable, and secure infrastructure on Microsoft Azure, leveraging tools like Digital Twins, EventHubs, and ADX clusters.
The role involves automating deployments with Bicep and GitHub Actions, optimizing CI/CD pipelines, and ensuring system observability through Azure Monitor and Log Analytics. You will drive performance optimization, enforce security best practices, and refine and test our disaster recovery strategies. As a key collaborator with engineering teams, you’ll integrate DevOps practices, mentor team members, and ensure the smooth operation of mission-critical services, fostering a culture of reliability and efficiency.
Design, implement, and manage highly reliable and scalable systems on Microsoft Azure, with a focus on Azure Digital Twins, Event Hubs, and ADX clusters.
Continuously enhance the performance, availability, and reliability of our cloud-based platforms.
Develop and maintain Infrastructure as Code (IaC) solutions using Bicep to ensure consistent, repeatable, and automated deployments.
Build and optimize CI/CD pipelines using GitHub Actions, enabling seamless integration and delivery processes.
Establish and monitor key performance indicators (KPIs) and service level objectives (SLOs) to measure and maintain system health.
Implement robust observability tools for logging, monitoring, and alerting to ensure rapid issue detection and resolution.
Proactively identify areas for performance improvement, scalability enhancements, and cost optimization across infrastructure and applications.
Collaborate with cross-functional teams to embed reliability and scalability best practices into development and operational workflows.
Conduct thorough post-incident reviews to identify and mitigate root causes, ensuring continuous improvement.
We're looking for the following qualifications:
5+ years of experience working as an SRE, DevOps Engineer, or similar role focused on cloud-native architectures.
Strong expertise in Microsoft Azure, particularly in Azure Digital Twins, Event Hubs, and ADX clusters.
Proven experience building and managing reliable and scalable systems in cloud environments.
Advanced proficiency in Infrastructure as Code (IaC) using Bicep.
Experience developing and maintaining CI/CD pipelines using GitHub Actions.
Solid understanding of performance optimization techniques for cloud-based systems.
Familiarity with SLOs, SLIs, and KPIs to track and ensure system reliability.
Hands-on experience with monitoring and alerting tools (e.g., Azure Monitor, Prometheus, Grafana).
Strong programming or scripting skills in PowerShell, Bash, or similar languages.
What do we offer you, on top of a suitable and competitive salary?
A one-year contract, with the intention to extend (as this is a permanent position).
Holiday pay (8% off the gross salary).
27 vacation days per calendar year.
Remote work: opportunity to work abroad up to 10 days per calendar year.
Wellbeing: we very much value our employees’ wellbeing. Besides the weekly bootcamp sessions we also offer delicious coffee and tea, healthy lunches, and snacks at the office.
Company laptop and a monthly allowance of EUR 25 to cover phone costs.
Development: an individual training budget up to EUR 1000 yearly and 5 paid 'personal growth' days a year to spend how you see fit.
Travel allowance (based on actual costs / kms).
Please note that this offer is based on full-time (40 hours) employment.
We are a dynamic and innovative team of 8 engineers from diverse backgrounds, coming together from all around the world. Our team thrives on collaboration, mutual respect, and a deep appreciation of each other's strengths, forming a closely bonded group both in and out of the office.
We take pride in being open-minded and enjoy challenging each other to identify the best technologies and solutions for our products. Sharing knowledge and helping each other grow is at the heart of what we do. Our bi-weekly sessions are dedicated to exchanging ideas, sparking inspiration, and fostering cross-disciplinary collaboration among our backend/frontend developers, data engineers, and building system modeling experts.
Of course, we’re also tech enthusiasts who love to make jokes, talk about the latest tech-trends, and celebrate our shared passion for hot sauces. If you’re looking for a collaborative, fun-loving, and forward-thinking team, you’ll find your place with us.
Our Talent Acquisition team and hiring manager will review your application, and aim to respond within 2 weeks.
If you seem like a good fit, we’ll invite you to a screening call so we can learn more about each other.
When we both feel like moving forward, we would like to invite you for a Technical Interview to assess your skillset. You will meet the team and in the final round our Chief Product Officer.
If you are passionate about developing sustainable real estate solutions, and possess the necessary skills and experience, we invite you to join our team at Next Sense.
Do you want to join our team as our new Site Reliability Engineer (SRE)? Then we'd love to hear about you!