Manager Data Reliability Engineering

Hyderabad , India
full-time

AI overview

Design and manage cloud-based data systems to ensure reliability, scalability, and security while mentoring junior engineers in a culture that values innovation and inclusivity.
About Zeta: Build the future of banking. Zeta is a next-generation banking technology company providing cloud-native, fully stackable processing and core banking platforms for issuers. With a focus on scalability, compliance, and innovation, Zeta empowers financial institutions to modernize their technology infrastructure and deliver secure, seamless digital banking experiences.  Our impact runs at real-world scale. Today, over 25 million cards are live on Zeta-powered platforms across 7 countries, supported by a passionate team of 1,700+ Zetanauts across India, the US, EMEA, and Asia. Backed by SoftBank Vision Fund, Mastercard, and other reputed strategic investors, we reached a valuation of $2 billion in 2025. Our focus is on establishing product lines that focus on key outcomes by addressing real customer pain points, modernizing legacy systems, and strengthening core fundamentals. As a result, our systems and platforms support a wide range of banking and payments capabilities, including: 1. Tachyon, our cloud-native banking stack built for population-scale systems 2. Cipher, our unified authentication platform for secure, high-volume banking environments 3. Digital Credit as a Service, enabling banks to launch credit lines on UPI 4. Elena, our intelligent and conversational AI platform for banking. 5. Pixel, India’s first digital-native credit card, launched in partnership with HDFC Bank, for whom we also revamped their PayZapp mobile app: Winner of the Celent Model Bank Award for Payments Innovation 2024. 6. Sparrow, the leading card experience for non-prime cardholders in the US …and more across cards, payments, lending, and core banking. We are an engineering-first organization that values ownership, bias for action, and long-term thinking. Together, we solve some of the hardest problems in banking tech. Our culture is built around trust, collaboration, and creating the conditions for you to drive impact proportionate to your potential. Reinforcing our commitment to creating an inclusive and supportive workplace, we have been consistently recognized as a Great Place to Work. If you want to build cutting-edge banking tech that enables banks to serve millions reliably, securely, and at a population scale, Zeta is your playground.If you would like to learn more about how we have grown and evolved over the years, watch our journey here. You can also explore our website and follow us on LinkedIn, Instagram,YouTube, and X. About the Role: As a Lead Data Reliability Engineer, you will be accountable for designing, deploying, and managing our complex cloud-based data systems. You will lead efforts to ensure the reliability, scalability, and security of our data infrastructure, while mentoring and guiding junior team members. Responsibilities:
  • Leverage deep expertise in database management and optimization to ensure high performance and reliability of our data systems.
  • Identify bottlenecks and performance issues within data pipelines, optimize query performance, data access, and overall data processing.
  • Design, deploy, and manage complex data systems in cloud environments (e.g., AWS, Azure, GCP) using tools such as Terraform and adhering to AWS well architected Framework.
  • Develop and implement complex automation solutions using tools such as Jenkins and scripts to streamline data operations and enhance efficiency.
  • Architect and manage enterprise HA and DR solutions to ensure business continuity and data availability.
  • Expertly analyze and optimize database performance, identifying and resolving bottlenecks.
  • Ensure adherence to cloud security best practices and compliance standards, protecting sensitive data and systems.
  • Manage complex incidents, troubleshoot issues, and implement effective solutions to maintain data integrity and system reliability.
  • Demonstrate leadership skills, mentor junior team members, and foster a collaborative and communicative team environment.
  • Skills:
  • Leadership: Proven leadership skills with the ability to inspire and motivate a team.
  • Project Management: Proven experience in managing complex database projects.
  • Database Management Systems: Expertise in one or more relational database management systems.
  • Security and Compliance: In-depth knowledge of database security principles and compliance requirements.
  • Performance Tuning: Advanced skills in monitoring and optimizing database performance.
  • Team Collaboration: Effective collaboration with cross-functional teams and departments.
  • Vendor Management: Experience in engaging with database technology vendors.
  • Problem-Solving: Strong problem-solving skills, especially in resolving complex database issues.
  • Communication: Excellent communication skills for conveying technical information to both technical and non-technical stakeholders.
  • Strategic Planning: Ability to contribute to the development of the organization's overall database strategy.
  • Experience and Qualifications:
  • Bachelor’s degree in Computer Science or equivalent with 8 - 11 years of hands-on experience in database management and optimization on various relational databases with PostgreSQL as primary skillset.
  • Experience in Cloud Databases administration - Configuration, Backup/Restore, replication which are in the order of 10s of TB in size. 
  • Expertise in identifying and resolving performance issues within data pipelines.
  • Experience in developing and implementing complex automation solutions using tools like Jenkins, Terraform & Python.
  • Experience in architecting and managing high availability and disaster recovery solutions of Databases in the cloud across regions.
  • Expert in performance monitoring and optimization of database systems.
  • In-depth knowledge of cloud security best practices and compliance standards.
  • Extensive experience in managing complex incidents and troubleshooting.
  • Should be able to solve problems, make sound decisions and use good judgment. 
  • Leadership skills with the ability to mentor and guide junior team members.
  • Experience with modern data processing tools and frameworks (e.g., Apache Kafka, Apache Spark, Airflow, Debezium etc.) is a plus.
  • Zeta is an equal opportunity employer.  
    At Zeta, we are committed to equal employment opportunities regardless of job history, disability, gender identity, religion, race, marital/parental status, or another special status. We are proud to be an equitable workplace that welcomes individuals from all walks of life if they fit the roles and responsibilities.

    Zeta Optima is changing how corporates manage employee meal e vouchers and other digital tax saving benefits. All Optima grants can be used via app, card or tag.

    View all jobs
    Ace your job interview

    Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

    Reliability Engineer Q&A's
    Report this job
    Apply for this job