Senior DevOps Engineer with SRE Expertise - EMEA

Rescue.co is a rapidly growing tech startup revolutionizing emergency response. Our 24/7 Dispatch Center serves millions of Kenyans, while our platform powers Emergency Medical Service apps.

Our vision? Emergency help is minutes away, anytime, anywhere. In the past five years, we've built a lifesaving platform and the largest ambulance and responder network in East Africa. Our platform has slashed response times by over 80%, saving countless lives.

Now, we're expanding and seeking dedicated team members to steer this next phase of growth. If you're passionate, hardworking, and ready to make a difference, join us! We're a diverse team of over 52 specialists in business, operations, software engineering, and emergency dispatch, based in Nairobi but reaching across Kenya and beyond. Focused on achieving goals, we're always refining our tech and operations, guided by values like agility, grit, and innovation.

If you're passionate about making a difference during crises and can stay calm under pressure, we want you. We value teamwork, multitasking, attention to detail, and a love for new technology. Ready to bring your best to work every day, grow your career, and make a real impact? Come join us.

Our Values:

  • Consistency: We maintain high standards in everything we do.
  • Design with EQ: We create with empathy, focusing on human needs.
  • Always Questioning: We encourage curiosity and continuous improvement.
  • Alignment: We work together towards common goals.

About the role:

We are seeking a skilled professional to design and manage deployment workflows, automate CI/CD pipelines, and enhance internal development platforms for seamless testing and cloud deployments. The role involves implementing robust monitoring and observability solutions to ensure system reliability, diagnosing and resolving production issues swiftly, and collaborating with teams to optimize performance and uptime. Ideal candidates will excel in streamlining processes, proactive problem-solving, and driving efficiency across multi-regional cloud environments.

This is what you will do: 

Deployment Workflows

  • Design and build deployment workflows using GitHub Actions, ensuring scalability and efficiency.
  • Automate and optimize CI/CD pipelines for multi-regional cloud deployments.
  • Internal Dev Platform - Develop robust workflows to support local testing and independent cloud-based feature branch testing environments.

Monitoring & Observability

  • Implement comprehensive monitoring, logging, and alerting systems using tools like Prometheus, Grafana, OpenSearch, OpenTelemetry, or AWS CloudWatch.
  • Ensure rapid detection of anomalies and proactive performance optimization.

Incident Management & Quick Fixes

  • Quickly diagnose and resolve production issues proactively to maintain 99.999% uptime.
  • Collaborate with engineering teams to provide initial assessments and possible workarounds for production issues or vulnerabilities.
  • Discuss and plan integration tasks to enhance overall system performance.

Requirements

Technical Skills

  • Proven experience building and managing deployment workflows with GitHub Actions.
  • Expertise in containerization and orchestration technologies such as Docker, Kubernetes, and Helm.
  • Strong hands-on knowledge of AWS (Containerization services, S3, Lambda, RDS, VPC, EventBridge) and cloud-native tools.
  • Proficiency in scripting and automation using Bash, Python, or Node.js.
  • Strong experience with observability tools (e.g., Prometheus, Grafana, Sentry, CloudWatch, Elasticsearch, OpenTelemetry) to ensure optimal system performance and uptime.
  • Supporting code for internal scalability, security, and reusability.
  • Experience writing production-grade code in TypeScript and Node.js.
  • Knowledge of databases (PostgreSQL, MongoDB)

DevOps Expertise

  • Strong background in building efficient CI/CD pipelines and automating deployment workflows.
  • Familiarity with test automation frameworks for local and cloud-based environments.

SRE Practice

  • Experience in incident management, including root cause analysis and implementing proactive fixes.
  • Ability to define and monitor SLOs, SLAs, and error budgets.

Soft Skills

  • Collaborative mindset and excellent communication and leadership abilities to coordinate with a diverse technical team.
  • Strong problem-solving skills.
  • Ability to work in an agile development environment.
  • Commitment to delivering reliable and efficient work/high work quality expectations.

Bonus points if:

  • AWS Certifications (Professional level preferred).

Benefits

To ensure you bring your 100% self to work, we are happy to share with you what we are offering once you choose to join us at Rescue: 

  • A chance to make an impact in a mission-driven organization.
  • Fully remote role with the flexibility to work from anywhere.
  • Unlimited paid time off.

How to Apply

This role is fully remote and offered on a full-time contract basis. The role is open to candidates residing within the EMEA time zones, spanning GMT to GMT+3. Join us in our mission to serve and uplift our community. Apply now, and let's make a positive impact together! 

P.S. While we're eager to learn about your experience, please submit your impressive journey in a one-page PDF CV. This helps us get started more quickly.

At Rescue.co, we believe in the power of authenticity. We value you for who you are, regardless of your gender, age, ethnicity, race, sexual orientation, religion, veteran/military status, disability, or any other characteristic protected by local laws and regulations. Bring your true self to work!

Flare is a real-time platform that brings together the fragmented ecosystem of emergency responders in emerging markets. We use real-time data to coordinate emergency response to save lives. Described as the the 911 of the future, by Fast company, Flare uses modern technologies to save lives and make peace-of-mind accessible to billions of people living without access to emergency support.Learn more: Fast Company Article: https://www.fastcompany.com/company/flare

View all jobs
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Senior DevOps Engineer Q&A's
Report this job
Apply for this job