Site Reliability Engineer (SRE)

AI overview

Drive change in a small engineering team by expanding AWS deployments and improving the observability and reliability of services in a new cloud product.

Role overview

Join us in building our new cloud product in AWS. This is a role in a product Engineering team working on a specialised Cybersecurity Application tooling - https://waratek.com/products

You will work closely with the Engineering team of four, helping them define IaC and observability, build a secure process and support them in maintaining their applications. The focus of this role will be in defining the framework and toolbelt for engineering to use. We currently have a SaaS and on-prem offering - SaaS platform is the main focus for this role. We don’t have a dedicated Ops or SRE team yet.

Responsibilities

  • Being a driving force for change in a small engineering organization.
  • Expanding and evolving our AWS deployments.
  • Working closely with Engineering on improving our processes, observability, and reliability of our services.
  • Building our SaaS to support the launch of our new IAST product.

Core experience / qualifications

  • 3+ years of experience in a SRE/DevOps role.
  • Practical experience with Java applications and ecosystem.
  • A good knowledge of AWS, Containers and Kubernetes, IaC tools (CloudFormation, Terraform, Pulumi), Monitoring tools (DataDog, Grafana…).
  • Understanding of security best practices in software development.
  • Experience with a variety of testing approaches.
  • Excellent communication and collaboration skills.
  • Demonstrated ability to work on one's own initiative.
  • Proven ability to work well with other teams and roles.

Desirable experience / skills

  • Experience with relational and NoSQL databases such as MySQL, PostgreSQL, and Elasticsearch.
  • Experience with Oracle Cloud Infrastructure (OCI).
  • Exposure to software security and its tooling - e.g., GitHub Advanced Security, Snyk, Tenable, and others.
  • Experience with GitHub Actions.
  • Exposure to SOC 2 / ISO 27001 / GDPR technical requirements.
  • Experience designing, deploying, and managing Windows services.
  • Leadership experience.
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job