We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on automation, Unix, PowerShell scripting, and Oracle database management. The ideal candidate will have extensive experience in building and maintaining PaaS environments on Oracle Cloud Infrastructure (OCI), leveraging automation tools, and scripting in Bash, Python, and PowerShell. Proficiency in Terraform for infrastructure automation is essential, along with expertise in Oracle databases, performance tuning, and cloud-native services. Experience with GitHub Enterprise and GitHub Actions for CI/CD automation is a key requirement for this role.
Responsibilities:
PaaS Environment Development and Management:
Design, build, and maintain Platform-as-a-Service (PaaS) environments on Oracle Cloud Infrastructure (OCI), ensuring scalability, reliability, and security for cloud applications.
Infrastructure Automation & Scripting:
Automate infrastructure operations using Bash, Python, and PowerShell scripting, along with Terraform to streamline Oracle Cloud resource provisioning and management. Ensure efficient automation of Oracle database tasks such as backups, scaling, and performance optimization.
Site Reliability Engineering (SRE):
Apply SRE principles to improve the reliability, availability, and performance of cloud services. Proactively identify and resolve system issues, and ensure systems are highly available by automating routine tasks such as monitoring, alerting, and incident response. Work on service-level objectives (SLOs), service-level indicators (SLIs), and ensure error budgets are adhered to.
Unix Systems & Oracle Database Management:
Work extensively with Unix-based systems for server management, and implement best practices for Oracle database performance tuning, high availability, and backup automation.
Oracle Cloud Platform Engineering:
Leverage Oracle Cloud Infrastructure (OCI) services for robust and scalable cloud environments, managing infrastructure and application deployment.
DevOps and CI/CD Pipelines with GitHub Enterprise & GitHub Actions:
Implement DevOps practices and build CI/CD pipelines using GitHub Enterprise and GitHub Actions to automate deployment, testing, and monitoring processes. Ensure consistent and repeatable system updates with a focus on speed and efficiency.
Monitoring, Alerting, and Incident Management:
Integrate systems with monitoring tools like Prometheus and visualization platforms like Grafana to set up alerts, monitor performance, and reduce downtime. Automate incident management and resolution to meet availability targets.
Security and Compliance:
Implement security best practices and ensure compliance with industry standards for data protection and regulatory adherence within Oracle Cloud infrastructure.
Qualifications:
Nice to Have:
We realize that managing work life balance is a challenge we all face in our daily lives and in order to support with this we are pleased to offer hybrid and flexible working for most of our Avaloqers to maintain work life balance and still continue our fantastic Avaloq culture in our global offices.
In Avaloq we are proud to embrace diversity and understand the success of our business is built on the power of different opinions, we are whole heartedly committed to fostering an equal opportunity environment and inclusive culture where you can be your true authentic self.
We hire, compensate and promote regardless of origin, age, gender identity, sexual orientation or any other fantastic traits that make us all unique, we have done our best to write this advert in an inclusive and neutral way.
Please be aware that we will not accept speculative CV submissions for any of our roles from recruitment agencies, and any unsolicited candidate submissions will be exempt from any payment expectations.
#LI-Hybrid