KnowBe4 is hiring a

Site Reliability Engineer (Internal Engineering) (Remote)

Remote

About KnowBe4

KnowBe4, the provider of the world's largest security awareness training and simulated phishing platform, is used by tens of thousands of organizations around the globe. KnowBe4 enables organizations to manage the ongoing problem of social engineering by helping them train employees to make smarter security decisions, every day.

Fortune has ranked us as a best place to work for women, for millennials, and in technology for four years in a row! We have been certified as a "Great Place To Work" in 8 countries, plus we've earned numerous other prestigious awards, including Glassdoor's Best Places To Work.

Our team values radical transparency, extreme ownership, and continuous professional development in a welcoming workplace that encourages all employees to be themselves. Whether working remotely or in-person, we strive to make every day fun and engaging; from team lunches to trivia competitions to local outings, there is always something exciting happening at KnowBe4.

The Internal SRE ensures the reliability, scalability, and performance of internal systems and infrastructure. This role involves monitoring, automation, incident management, and maintaining self-hosted platforms to support smooth development operations. The Internal SRE works closely with cross-functional teams to manage GitLab CI/CD workflows and cloud infrastructure on AWS. The position emphasizes proactive problem-solving, automation, and collaboration to continuously improve system stability and efficiency.

Responsibilities:

  • Manage and maintain GitLab environments to ensure high availability and security.
  • Design and implement CI/CD pipelines to automate software delivery.
  • Monitor and troubleshoot system performance issues, using observability tools like Prometheus, Grafana, or Datadog.
  • Collaborate with development teams to align infrastructure efforts with project needs and timelines.
  • Build and maintain infrastructure as code (IaC) solutions using tools like Terraform and Ansible.
  • Manage AWS services, including ECS, S3, API Gateway, DynamoDB, RDS, IAM, and VPC.
  • Participate in incident response, conducting root cause analysis and post-incident reviews.
  • Automate manual tasks to improve operational efficiency and reduce technical debt.

Minimum Qualifications:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Equivalent work experience in SRE, DevOps, or infrastructure management may substitute for formal education.
  • GitLab Administration: Experience managing and securing self-hosted GitLab environments.
  • CI/CD Workflows: Expertise in designing and maintaining automated pipelines for continuous delivery.
  • AWS Cloud Expertise: Strong knowledge of AWS services, including ECS, S3, API Gateway, DynamoDB, RDS, IAM, VPC, and Lambda.
  • Infrastructure-as-Code: Proficiency in Terraform, Ansible, or similar tools.
  • Monitoring and Observability: Experience with Prometheus, Grafana, Datadog, or other observability platforms.
  • Automation and Scripting: Proficiency in Python, Bash, or other scripting languages to automate tasks.
  • Incident Management: Ability to lead incident response efforts and conduct root cause analysis.
  • Collaboration and Communication: Strong interpersonal skills to work effectively across teams and with stakeholders.

The base pay for this position ranges from $110,000 - $125,000, which will vary depending on how well an applicant's skills and experience align with the job description listed above.

We will accept applications until 2/18/2025.

Our Fantastic Benefits

We offer company-wide bonuses based on monthly sales targets, employee referral bonuses, adoption assistance, tuition reimbursement, certification reimbursement, certification completion bonuses, and a relaxed dress code - all in a modern, high-tech, and fun work environment. For more details about our benefits in each office location, please visit www.knowbe4.com/careers/benefits.

Note: An applicant assessment and background check may be part of your hiring procedure.

Individuals seeking employment at KnowBe4 are considered without prejudice to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, sexual orientation or any other characteristic protected under applicable federal, state, or local law. If you require reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please visit www.knowbe4.com/careers/request-accommodation.

No recruitment agencies, please.

Apply for this job

Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job