Salary: £72,145 - £90,182 DOE + Benefits + Bonus
Hythe, Kent
We care deeply about inclusive working practices and diverse teams. If you'd prefer to work part-time or as a job-share, we'll facilitate this wherever we can - whether to help you meet other commitments or to help you strike a great work-life balance.
About
We’re looking for a Senior Site Reliability Engineer who can play a pivotal role in driving our strategy by enhancing the dependability, scalability, and accessibility of our mission-critical systems and services.
You’ll drive positive change in implementing advanced SRE best practices, orchestrating proactive monitoring of system performance, and leveraging your extensive knowledge to rapidly diagnose and remediate incidents with precision.
The Role
By joining our team you’ll:
- Focus on delivering ultra-scalable and highly reliable software systems running on cloud infrastructure
- Advance the implementation of IaC principles to enable consistent and automated infrastructure provisioning and configuration
- Take ownership of incident resolution processes, conducting in-depth root cause analysis, and implementing preventive measures to avoid future incidents.
- Identify system weaknesses and work to continuously improve performance through metrics analysis and capacity planning.
- Mentor junior engineers, share best practices, and ensure high standards of documentation for infrastructure and incidents.
- Define and build our Service Level Objectives for critical services, ensuring they align with the business and user expectations.
- Make data-driven decisions for scaling resources up or down to meet demand through auto-scaling policies
- Collaborate to ensure the security of both hardware and cloud assets, applying patches and security updates are maintained.
- Continuously working to optimise system performance by analysing metrics, identifying bottlenecks, and implementing performance enhancements.
- Support our group of businesses and the nuances of their purpose-built infrastructures
- Lead and assist in solving challenges and delivering innovative new experiences with other teams
- Form part of our business out-of-hours support rota (1 in 6)
What you can bring to our team:
- Proven experience in building scalable and high-performance systems, employing cutting-edge technologies and processes.
- Robust knowledge and experience in the tools and technologies essential for system administration, software development and operational tasks.
- A track record of being solutions focussed and an excellent problem solver
- Experience with containerisation technologies (e.g., Docker, Kubernetes)
- In-depth knowledge of automated deployment and configuration management tools (e.g Scalr, GitHub Actions, CircleCI, RunDeck)
- Demonstrable knowledge of a programming language (e.g NodeJS/Typescript or Python) and shell scripting (BASH)
- Experience with advanced monitoring & observability tools (e.g Sumo Logic, Grafana, Looker, Prometheus)
- Have a solid understanding of network architecture principles and practises, firewalls (security groups), NACLs, routing and debugging problems
- A passion for and demonstrable interest in all that is cloud and highly scalable distributed web platforms
- A pragmatic approach understanding the importance of having a scalable and reliable infrastructure
- A constant interest in staying ahead of future tools and technologies
Everyone’s career path is individual and different, so this is just a guide. If your experience doesn't precisely match this, you’re encouraged to apply so that we can discover your unique talents!
How we hire for this role
We know your time is precious, so we keep our recruitment process as quick and easy as possible. If we believe you might be a match for a job you’ve applied for, you’ll enter our hiring process as follows:
- Initial 30-45 min call understanding more about your previous experience and explaining the team & role
- On-site interview including technical scenarios
Cultivating a diverse and inclusive culture is paramount for us. Recognising we are all different, if for whatever reason you need us to adapt the process, please get in touch via [email protected].
Closing date for applications: Sunday 13th October 2024
Why choose Holiday Extras?
We believe that holidays are the most precious time of all, so we create products, tech and services that make travel and holidays memorable and fun. We’re on a mission to Remake Holiday Making, turning our customers’ needs, wants and dreams into reality, for every stage of their trip.
At Holiday Extras we’re creating a workplace where everyone can thrive, build their careers and reach their limitless potential. By joining our team, you can enjoy a world of benefits to enhance your lifestyle and wellbeing, including 25 days plus your birthday off, profit share, enhanced parental leave, discounted gym memberships and more! Learn more about our culture and benefits.