Avaloq is hiring a

Site Reliability Engineer

Pune, India
Full-Time

We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on automation, Unix, PowerShell scripting, and Oracle database management. The ideal candidate will have extensive experience in building and maintaining PaaS environments on Oracle Cloud Infrastructure (OCI), leveraging automation tools, and scripting in Bash, Python, and PowerShell. Proficiency in Terraform for infrastructure automation is essential, along with expertise in Oracle databases, performance tuning, and cloud-native services. Experience with GitHub Enterprise and GitHub Actions for CI/CD automation is a key requirement for this role.

Responsibilities:

PaaS Environment Development and Management:
Design, build, and maintain Platform-as-a-Service (PaaS) environments on Oracle Cloud Infrastructure (OCI), ensuring scalability, reliability, and security for cloud applications.

Infrastructure Automation & Scripting:
Automate infrastructure operations using Bash, Python, and PowerShell scripting, along with Terraform to streamline Oracle Cloud resource provisioning and management. Ensure efficient automation of Oracle database tasks such as backups, scaling, and performance optimization.

Site Reliability Engineering (SRE):
Apply SRE principles to improve the reliability, availability, and performance of cloud services. Proactively identify and resolve system issues, and ensure systems are highly available by automating routine tasks such as monitoring, alerting, and incident response. Work on service-level objectives (SLOs), service-level indicators (SLIs), and ensure error budgets are adhered to.

Unix Systems & Oracle Database Management:
Work extensively with Unix-based systems for server management, and implement best practices for Oracle database performance tuning, high availability, and backup automation.

Oracle Cloud Platform Engineering:
Leverage Oracle Cloud Infrastructure (OCI) services for robust and scalable cloud environments, managing infrastructure and application deployment.

DevOps and CI/CD Pipelines with GitHub Enterprise & GitHub Actions:
Implement DevOps practices and build CI/CD pipelines using GitHub Enterprise and GitHub Actions to automate deployment, testing, and monitoring processes. Ensure consistent and repeatable system updates with a focus on speed and efficiency.

Monitoring, Alerting, and Incident Management:
Integrate systems with monitoring tools like Prometheus and visualization platforms like Grafana to set up alerts, monitor performance, and reduce downtime. Automate incident management and resolution to meet availability targets.

Security and Compliance:
Implement security best practices and ensure compliance with industry standards for data protection and regulatory adherence within Oracle Cloud infrastructure.

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field with total IT experience > 8 years
  • Proven experience as a Cloud Engineer or Site Reliability Engineer (SRE) with minimum 3 years of experience and focus on Oracle Cloud Infrastructure (OCI) (Other cloud also fine).
  • Strong expertise in automation, Unix-based systems, and scripting using Bash, Python, and PowerShell.
  • Proficiency in infrastructure automation using Terraform for provisioning and managing Oracle Cloud resources.
  • Hands-on experience with GitHub Enterprise and GitHub Actions for building and maintaining CI/CD pipelines.
  • Extensive knowledge of SRE principles, including monitoring, alerting, incident response, and performance optimization.
  • Experience with Oracle database management, including performance tuning, backups, and high availability.
  • Familiarity with PaaS environment development and management on Oracle Cloud.
  • Expertise with monitoring and observability tools such as Prometheus and Grafana for performance monitoring and analysis.
  • Strong problem-solving skills, with the ability to proactively identify and resolve system issues.
  • Excellent communication and collaboration abilities, with a proven track record of working effectively in a team environment.

Nice to Have:

  • Oracle Cloud Architect Certification or relevant cloud certifications.
  • Experience in writing APIS in python or Go
  • Experience in docker & kubernatis 

We realize that managing work life balance is a challenge we all face in our daily lives and in order to support with this we are pleased to offer hybrid and flexible working for most of our Avaloqers to maintain work life balance and still continue our fantastic Avaloq culture in our global offices. 

In Avaloq we are proud to embrace diversity and understand the success of our business is built on the power of different opinions, we are whole heartedly committed to fostering an equal opportunity environment and inclusive culture where you can be your true authentic self. 

We hire, compensate and promote regardless of origin, age, gender identity, sexual orientation or any other fantastic traits that make us all unique, we have done our best to write this advert in an inclusive and neutral way. 

Please be aware that we will not accept speculative CV submissions for any of our roles from recruitment agencies, and any unsolicited candidate submissions will be exempt from any payment expectations.  

 

#LI-Hybrid

Apply for this job

Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job