Optimize systems that elevate educational access for all students
Are you ready to apply your technical expertise to make a significant impact in the educational sector? As a Senior Site Reliability Engineer at MasteryPrep, you'll ensure that our digital platforms are robust, scalable, and secure, providing a reliable learning environment for underserved students. Your skills will not only optimize our systems but also enhance the educational journey of thousands of students by ensuring uninterrupted access to critical learning resources.
In this role, you will work closely with the development and operations teams to design, implement, and maintain robust infrastructure solutions on Google Cloud Platform (GCP), while also leading cybersecurity initiatives to protect our data and systems from threats. Your proactive approach to system reliability will prevent disruptions and maintain the high availability our users depend on. Join our mission-driven team where your passion for technical excellence meets our commitment to making education accessible and impactful for all students.
Key Responsibilities:
- Design, implement, and manage cloud-native solutions using AWS, GCP, and other major cloud platforms, with a focus on scalability, reliability, and security.
- Automate infrastructure and deployment processes using Infrastructure as Code (IaC) tools such as Terraform and CloudFormation, ensuring efficient and consistent environments.
- Develop and maintain comprehensive monitoring, alerting, and logging systems to ensure system reliability and performance, proactively identifying and resolving potential issues.
- Serve as the primary cybersecurity resource for the company, staying abreast of the latest threats and implementing measures to protect our systems and data.
- Conduct regular security audits and vulnerability assessments to identify and address potential risks.
- Implement and maintain security controls, including access controls, encryption, and multi-factor authentication (MFA).
- Monitor system logs and network traffic for suspicious activity and respond to security incidents in a timely manner.
- Work closely with development teams to ensure that applications are developed and deployed securely, providing guidance on secure coding practices.
- Mentor junior team members and promote best practices in infrastructure automation and reliability engineering, fostering a culture of continuous improvement.
- Collaborate closely with development, operations, and product teams to optimize system performance and reliability, ensuring alignment with business objectives.
About MasteryPrep
Nearly 90% of low-income students graduate high school without a college-ready ACT or SAT score. MasteryPrep’s mission is to level the playing field in education by offering the most effective test preparation available – made accessible to all students.
Through more than 10 successful years of partnering with school districts and institutions on college readiness services and resources, MasteryPrep has surpassed one million students served since the company’s founding in 2012.
MasteryPrep increased its student outreach by 70 percent in 2021 and is ranked among the Inc. 5000 “Fastest Growing Companies,” featured by “Entrepreneur 360,” and selected among the “Growth Leaders” by Louisiana Economic Development.
Requirements
Required:
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience)
- 5+ years in a Site Reliability Engineering or DevOps role with a focus on cloud platforms (AWS, GCP, or Azure)
- Proficiency in Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, and CDK
- Proven experience as a Site Reliability Engineer or similar role, with a strong focus on cybersecurity
- Strong knowledge of Google Firestore, PostgreSQL, TypeScript, React, and Python
- Experience with Unix/Linux Administration
- Experience implementing and maintaining security controls on Google Cloud Platform (GCP), AWS, or Microsoft Azure
- Experience with monitoring and logging tools such as Prometheus, CloudWatch, Stackdriver, or Splunk
- Experience with containerization and orchestration technologies such as Docker and Kubernetes
- Strong scripting skills in languages such as Python, Bash, or PowerShell to enhance operational workflows
- Experience conducting security audits and vulnerability assessments
- Excellent problem-solving and troubleshooting skills
- Ability to work independently and as part of a team in a fast-paced environment
- Strong communication and collaboration skills
- Currently authorized to work in the United States
- *Note that US Visa sponsorship will not be provided*
Preferred:
- Certifications in AWS, GCP, or other cloud platforms
- Certification in cybersecurity (e.g., CISSP, CISM, CEH)
- Experience with security information and event management (SIEM) systems
- Familiarity with compliance standards such as FERPA, COPPA, or PCI DSS
- Knowledge of secure coding practices and application security testing techniques
- Experience with network security technologies such as firewalls, intrusion detection/prevention systems (IDS/IPS), and VPNs
- Currently located in the Eastern US timezone
Benefits
- $165,000-$175,000 starting salary based on qualifications
- Opportunity to work with cutting-edge technologies in a collaborative environment
- Flexible work hours and remote work environment
- Professional development opportunities and reimbursement for certifications and training
- Company-sponsored social events and team-building activities
- Employee benefits eligibility (health, disability, AD&D, life insurance)
- Matching 401k
- Paid time off
- Generous paid holidays