Team Lead, Site Reliability Engineering

Lagos , Nigeria
On-site

AI overview

Guide a squad of engineers to ensure reliability in a hyper-growth financial platform, focusing on architecture, mentorship, and strategic direction.

Who we are

Moniepoint is an all-in-one financial services platform for emerging markets and the second-fastest growing company in Africa.
Since 2019, Moniepoint’s technology has powered over 3 million people, offering personal and business banking, payment, credit and business management tools to help them succeed. Moniepoint processed $182 billion in 2023, and currently processes the majority of the POS transactions in Nigeria. 

Curious about what makes Moniepoint an incredible place to work? Check out posts on how we cultivate a culture of innovation, teamwork, and growth.

 

Job Summary
We are seeking an SRE Team Lead to guide a squad of site reliability engineers responsible for the reliability of our highly distributed financial platform.You will be designing high-level reliability architecture, while also mentoring engineers, defining the technical roadmap, and driving the culture of Site Reliability Engineering within a team. You will balance strategic leadership with deep technical work to ensure our systems and our people can scale linearly with our hyper-growth.

 

Responsibilities

  • Set the technical direction for the SRE team. Architect self-healing systems, define reliability standards (Production Readiness Reviews), and drive the adoption of observability as Code and automation best practices.

  • Define and enforce the end-to-end standard for system visibility. You will guide teams to deeply instrument their code (logging, tracing, metrics) and govern the monitoring ecosystem to ensure alerts are actionable, strictly minimizing noise (alert fatigue) while maximizing our ability to detect and resolve issues proactively.

  • Lead, mentor, and grow a team of Senior and Associate SREs. Conduct code reviews, facilitate technical workshops and foster a culture of engineering excellence.

  • Act as the ultimate escalation point for major incidents. Beyond firefighting, you will refine the Incident management process, ensuring the process is efficient and that RCAs lead to actionable engineering fixes.

  •  Partner with Engineering Managers and Product Leads to define Service Level Objectives (SLOs) that align with business goals. 

Requirements

  • Minimum of 5 years of experience in SRE or Backend Engineering, with at least 2 years in a Lead or Senior/Staff role mentoring others.

  • Expert-level proficiency in Java, Go, Rust, or Python. You set the standard for code quality within the team.

  • Mastery of distributed systems patterns. You can design scalable architectures, debug complex microservices interactions, and explain architectural trade-offs to stakeholders.

  • Deep expertise with Google Cloud Platform (GCP) or AWS.  You have extensive experience running Kubernetes (GKE) at scale and troubleshooting deep infrastructure issues.

  • Proven experience defining observability strategies for large teams. You have deep expertise in architecting the complete telemetry stack: from custom instrumentation to monitoring and actionable alerting.

  • Strong communication skills with the ability to de-escalate high-pressure war rooms with calm authority.

 

What we can offer you

  • Culture - We put our people first and prioritize the well-being of every team member. We’ve built a company where all opinions carry weight and where all voices are heard. We value and respect each other and always look out for one another. Above all, we are human.
  • Learning - We have a learning and development-focused environment with an emphasis on knowledge sharing, training, and regular internal technical talks.
  • Compensation - You’ll receive an attractive salary, pension, health insurance, annual bonus, plus other benefits.

What to expect in the hiring process

  • A preliminary phone call with the recruiter
  • A technical interview with the Hiring Manager
  • A behavioural and technical interview with a member of the Executive team. 

Moniepoint is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees and candidates.

 

Moniepoint Inc. is a leading financial technology company that provides a seamless platform for businesses, their employees and customers, to accept payments digitally, receive credit and access business management tools that enable them to grow with ease. We are the parent company of TeamApt Ltd and Moniepoint MFB and we support over 1,800,000 businesses to process $12 billion monthly through our digital payment acceptance channels. For our work in making digital payment accessible to businesses in emerging markets, our Nigerian subsidiary was awarded the National Inclusive Payment Initiative Award by the Central Bank of Nigeria. In 2022, CB insights recognised us as a top global fintech. We are backed by QED, British International Investment, FMO, and other leading global venture capital funds. Moniepoint Inc. is a fully remote tech company with a diverse workforce worldwide and is headquartered in London, with offices in the US, Nairobi and Lagos. Join us as a #DreamMaker to help power the dreams of businesses globally.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job