People Can Fly is hiring a

Senior Site Reliability Engineer

Warszawa, Poland
Full-Time
  • Design, develop, deploy and operate reliable and scalable infrastructure for the online services platform
  • Collaborate with cross-functional teams to translate business requirements into technical solutions, balancing user needs with technical constraints.
  • Automate deployment of the online services platform to cloud providers, including provisioning for various stages like development, testing, and external publishers.
  • Develop and implement systems to maximise reliability, scalability, and uptime while also optimising for cost,
  • Design and develop systems and tooling that support efficient maintenance, updates, and recovery
  • Create tooling, data sources, monitoring dashboards, and alerting for all online services products, with a particular focus on real time service health
  • Lead Incident Management of live issues, as well as troubleshooting, break-fix and resolution of those issues
  • Create, review and maintain essential operational documentation such as run books, post-mortem reports, and root cause analysis 
  • Assist leads with recruiting, onboarding, development and mentorship of engineers.
  • Stay updated on emerging SRE technologies and industry trends, evaluating their potential impact on our development processes and strategies.
  • 4+ years of extensive experience in infrastructure engineering, with a specific focus on Cloud Infrastructure
  • Strong knowledge of, and experience with, writing and optimising Terraform.
  • Strong knowledge of, and experience with Infrastructure-as-Code (IaC) and related best practices
  • Strong in at least one programming language (Python, Go, Kotlin, Java or similar) as well as with scripting and automation in general
  • Good grasp of network architecture and security  best practices.
  • Familiarity with CI/CD pipelines and tools like Github Actions, Jenkins
  • Proficient with Source Control and Code Review tools (Git/Github, Perforce/Swarm etc.).
  • Experience setting up monitoring and alerting systems
  • Experience with Incident Management and troubleshooting live issues
  • Ability to analyse and improve system performance, strong troubleshooting skills across various technology layers.
  • Knowledge in designing and implementing disaster recovery strategies.
  • Strong mentoring skills.
  • Strong verbal and written communication skills in English.

Nice to have:

  • Experience in the Video Games Industry
  • Unreal Engine knowledge (C++ in particular)
  • Experience in content distribution, ad-tech, news, mobile gaming, or finance domains
  • Additional language proficiency
  • Additional project management and bug tracking software knowledge

What we offer:

  • Private medical healthcare including dental treatment for PCF members and their families (Signal Iduna).
  • MultiSport card for you and your family members or friends.
  • Free library with a wide range of games and books you have unlimited access to.
  • In-company Polish and English language classes.
  • Fresh fruit, snacks, and beverages for everyone in the office.
  • Flexible working hours.
  • Free virtual health and mental wellbeing sessions are included in the plan for members and their dependents.
  • Personal development opportunities and ability to work in a global environment.
  • Work in a creative team with people full of passion for what they do.

We are committed to an inclusive and diverse work culture. PCF is an equal opportunity employer. We do not discriminate based on race, color, ethnicity, ancestry, national origin, religion, sex, gender, gender identity, gender expression, sexual orientation, age, disability, genetic information, marital status or any legally protected status.

Apply for this job

Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Senior Site Reliability Engineer Q&A's
Report this job
Apply for this job