Lead the design and implementation of automation, monitoring, and incident response processes while mentoring engineers and ensuring system reliability for cloud-native applications.
Lirio is a technology/software company that provides expertise in a variety of behavioral science domains (e.g., behavioral economics, social psychology, public health), data science, and machine learning to drive consumer engagement, close gaps in preventive and chronic care, and promote health and well-being across an individual’s lifespan. Lirio’s behavior change AI platform unites behavioral science with advanced artificial intelligence (AI) to deliver Precision Nudging health interventions. Precision Nudging is the application of behavioral science to health interventions personalized by AI to each individual that overcome barriers to action at the right time and place for scalable, behavior change.
This is a remote role with the opportunity to be hybrid if located in Tennessee. All applicants must be authorized to work in the US without sponsorship.
To ensure an excellent onboarding experience and integration into the company, new colleagues will spend their first week onsite at one of our offices in Tennessee. Travel expenses will be paid. This is a requirement.
The Senior System Reliability Engineer (SRE) at Lirio is responsible for the reliability, scalability, and performance of our cloud-native applications and infrastructure. This role leads the design and implementation of automation, monitoring, and incident response processes, and mentors other engineers in SRE best practices. The Senior SRE partners with development teams to ensure robust, secure, and highly available systems, and drives continuous improvement in operational excellence.
This role operates as a senior, hands-on reliability engineer embedded with product and platform teams. The Senior SRE is accountable for defining and enforcing service-level objectives (SLOs), reducing operational toil through automation, and improving system reliability through proactive engineering rather than reactive support. This role is not ticket-driven operations and is expected to influence architecture, development practices, and incident readiness across the platform
Reliability Engineering & Automation (40%)
Peer Reviews & Collaboration (10%)
Operational Support & Incident Management (20%)
Mentorship & Knowledge Sharing (10%)
Continuous Learning & Innovation (10%)
Documentation & Process Improvement (5%)
Flexible Work Hours
Flexible time off policy
Health Insurance
Vision
Paid holidays and company closures
10 paid holidays, quarterly company closure dates, + holiday week company closure
Paid Parental Leave
6 weeks paid parental leave
Remote-Friendly
Work from home
Lirio builds a behavior change AI platform that integrates behavioral science and artificial intelligence to improve health outcomes through tailored communications. By offering expertise in behavioral science, data science, and machine learning, Lirio empowers healthcare organizations to enhance consumer engagement and address gaps in care.
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Reliability Engineer Q&A's