We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our systems.
The ideal candidate will bring extensive knowledge of Kubernetes, containerisation technologies, cloud environments (e.g., AWS and/or GCP), and a strong programming background. You should be passionate about automation, security, and upholding best practices in reliability engineering.
In this role, you will:
- Collaborate with cross-functional teams to enhance system reliability, performance, and scalability.
- Implement and maintain automation processes to streamline operations.
- Drive security initiatives and adhere to best practices in cybersecurity.
- Monitor and analyze system performance, identifying areas for improvement.
- Contribute to the development and maintenance of IaC scripts.
- Participate in on-call rotation and incident response activities.
We are looking for people with:
- Minimum 2 years of experience as an SRE, DevOps or Software Engineer
- Strong knowledge of Kubernetes at scale and containerization technologies in general
- Strong programming ability in an object-oriented programming (OOP) language with proficiency in script writing
- Expertise in a cloud environment, particularly AWS and/or GCP
- Expert and up-to-date knowledge of security best practices.
- Familiarity with SRE principles, including SLI (Service Level Indicators), SLO (Service Level Objectives), SLA (Service Level Agreements), Toil, Uptime, and Observability.
- Solid understanding of networking fundamentals
- Commitment to standardization and documentation
- Experience with Linux
- Proficiency in software delivery automation, CI/CD (Continuous Integration/Continuous Deployment), and SDLC (Software Development Life Cycle).
- Experience with metrics, monitoring, and alerting using tools such as Prometheus, Grafana, or similar.
- Familiarity with IaC tools, including Ansible and Terraform.
- Experience with Elasticsearch, Kafka, MySQL, Postgres, Windows, and Redis
What we offer:
- 25 vacation days and extra vacation days after age and after children
- Cafeteria benefit via SZEP card
- Medicover private health insurance for employees and their family members
- 100% paid sick leave
- Excellent hardware bundle
- Possibility to work fully remote
- Full Calm subscription, 24/7 Employment Assistance Program
- Informal, friendly, welcoming environment focused on people, learning & development
Please note that this position is only available to candidates who are currently residing in Hungary. Unfortunately, we are unable to consider applicants outside of the country.
#LI-Remote
#LI-LS1