DevOps Engineering Manager, SRE and Cloud Service

AI overview

Lead DevOps and SRE teams to ensure reliability and performance of cloud platforms while fostering a culture of continuous improvement and collaboration across global teams.

Role Overview

The DevOps Engineering Manager, SRE and Cloud Services is responsible for leading the teams that ensure the reliability, scalability, and performance of IntegriChain’s cloud platforms and production systems. This role manages DevOps and Site Reliability Engineering functions, with a strong focus on cloud infrastructure, automation, and operational excellence.

You will work closely with application engineering, platform, security, and IT teams to support product delivery while maintaining high standards for availability, resilience, and security. This role balances people leadership with hands-on technical engagement and is critical to the success of our SaaS platforms in a healthcare and life sciences environment.

How a Day in This Role Looks

Your day typically starts with connecting to the team through daily standups or operational check-ins. You review system health, active work, incidents, and priorities, making sure the team is focused on what matters most and that risks are addressed early. You stay close to production systems through dashboards, alerts, and direct conversations with engineers.

Throughout the day, you work directly with DevOps, SRE, and application engineering teams to remove roadblocks and keep work moving forward. This may involve helping troubleshoot issues, guiding technical decisions, or coordinating across teams to resolve dependencies. You are regularly involved in design and architecture discussions, helping teams think through reliability, scalability, performance, and operational readiness.

Because the team operates across multiple time zones, you spend time coordinating work and maintaining clear communication across regions. You help establish shared processes, clear handoffs, and consistent expectations so work continues smoothly around the clock.

When incidents or operational challenges arise, you support response efforts, help coordinate resolution, and ensure follow-up actions are completed. Over time, you help turn recurring issues into lasting improvements by strengthening automation, cloud practices, and reliability standards.

Key Responsibilities

DevOps and SRE Leadership

  • Lead and develop a team of DevOps and SRE engineers supporting cloud infrastructure and production systems.
  • Set clear priorities, goals, and expectations for reliability, performance, and operational readiness.
  • Foster a culture of ownership, continuous improvement, and learning across the team.

Cloud and Platform Operations

  • Oversee cloud infrastructure across environments, ensuring scalability, resilience, and cost efficiency.
  • Drive adoption of infrastructure as code, automation, and standardized tooling.
  • Partner with engineering teams to support platform needs and production deployments.

Reliability and Operational Excellence

  • Establish and improve SRE practices, including monitoring, alerting, incident response, and post-incident reviews.
  • Lead efforts to reduce operational toil and improve system reliability through automation and process improvements.
  • Support release management, change control, and operational governance.

Architecture and Engineering Collaboration

  • Participate in design and architecture discussions to ensure systems are built for reliability, scalability, and operability.
  • Review and guide implementation approaches related to CI/CD, cloud services, and platform architecture.
  • Advocate for operational best practices early in the development lifecycle.

Cross-Functional Collaboration

  • Work closely with Product, Engineering, Security, and IT teams to align operational priorities with business needs.
  • Communicate clearly with stakeholders on system health, risks, and improvement initiatives.
  • Support compliance and security requirements relevant to healthcare and life sciences technology platforms.
  • 7 or more years of experience in DevOps, SRE, or cloud engineering roles.
  • 3 or more years of experience leading or managing technical teams.
  • Strong hands-on experience with cloud platforms such as AWS, Azure, or GCP.
  • Experience with CI/CD pipelines, infrastructure as code, monitoring, and incident management.
  • Solid understanding of reliability, scalability, and operational best practices.
  • Strong communication and collaboration skills.
  • Experience supporting SaaS platforms in regulated or compliance-driven environments.

Preferred

  • Familiarity with SRE concepts such as SLIs, SLOs, and error budgets.
  • Experience working with globally distributed teams.
  • Background in healthcare or life sciences technology.

What does IntegriChain have to offer?

  • Mission driven: Work with the purpose of helping to improve patients' lives! 
  • Excellent and affordable medical benefits + non-medical perks including Flexible Paid Time Off and much more!
  • Robust Learning & Development opportunities including over 700+ development courses free to all employees

#LI-ZG1

IntegriChain is committed to equal treatment and opportunity in all aspects of recruitment, selection, and employment without regard to race, color, religion, national origin, ethnicity, age, sex, marital status, physical or mental disability, gender identity, sexual orientation, veteran or military status, or any other category protected under the law. IntegriChain is an equal opportunity employer; committed to creating a community of inclusion, and an environment free from discrimination, harassment, and retaliation.

Our policy on visa sponsorship for US based positions: Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by IntegriChain.

Perks & Benefits Extracted with AI

  • Flexible Work Hours: Excellent and affordable medical benefits + non-medical perks including Flexible Paid Time Off and much more!
  • Learning Budget: Robust Learning & Development opportunities including over 700+ development courses free to all employees

Careers at IntegriChain. Find Great Talent with Career Pages. | powered by SmartRecruiters | Find Great Talent with a Career Page.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

DevOps Engineering Manager Q&A's
Report this job
Apply for this job