Company Overview

300+ media companies as clients, $40+ billion in revenue processed, 25,000+ worldwide users

Operative is a revenue accelerant for media companies around the world. No other software company in AdTech space, brings a comparable depth of experience to create truly innovative software that performs across all platforms, revenue models and business units. We are a SAAS (Software as a Service) platform which helps clients manage advertisements both in the linear (TV) and digital space. We have been in the market for over two decades and have 1100+ employees with 12 offices spread across the globe. Operative is proud to play a pivotal role in the way advertising is bought, sold and managed across media industry.

Job Summary

As a Site Reliability Engineer at Operative, you will be at the center of our efforts to build and design scalable software solutions for our clients. You will part of a team of SRE’s whos mission is to enable platform observability, automation, improvements to deployment process and infrastructure as a code for our SaaS products running in AWS.

Your efforts will be critical to ensuring we are following the best practices such as infrastructure as code, security as code, use of deployment and maintenance automation at all stages of our SDLC. You will work closely with the software development, product and support teams and take direction from engineering leadership and architecture. This role will require people management skills and technical hands-on work.

Responsibilities

Collaborate with ProdOps teams and engineering stakeholders to understand their deliverables and help manage staff to enable your stakeholders achieve priority objectives and remove their blockers.
User industry best practices for CD, site reliability, and cloud infrastructure deployment and management.
Lead and collaborate on projects within the SRE/DevOps space.
Be a thought leader as it relates to SRE/DevOps across the R&D organization.
Automate the maintenance of highly scalable, fault-tolerant solutions in AWS.
Assist with compliance, evidence gathering, technical remediation for Operative's compliance and audit processes.
Meet KPIs and deliver on objectives that track and advance Operative's production operations maturity.
Act as an escalation point to assist engineers with debugging infrastructure and automation issues.
Ensure that sufficient monitoring and alerting is in place to help the broader engineering and support teams be more proactive at production support.
Help maintain and update live SaaS systems with 99.99% client uptime SLA's.
Work with the broader engineering and production teams to maintain 24x7x365 on-call support.
Work with awesome people on a daily basis.
Other duties as assigned.

Qualifications

Bachelor's Degree in Computer Science or related field required, the company is willing to accept experience or a combination of education and experience in lieu of a degree
5 years of combined experience in SRE, DevOps, software development, systems and/or network administration experience at an organization supporting dozens to hundreds of applications and/or servers, required
AWS experience is a must
At least 5 years of experience supporting custom software in a production environment
Minimum 2 years of experience in deployment / configuration management using tools like Chef, Ansible, Puppet, Octopus, Team Foundation Server; automation projects are an acceptable experience
Experience with Continuous Integration tools such as Jenkins or GitLab
Experience with automation/configuration management using either CloudFormation, Terraform, Ansible, or equivalents
Proven experience getting a SaaS product organization to true continuous deployment
Prior work experience with Container and Container Management frameworks (e.g., Docker, Kubernetes, AWS ECS)
Prior experience implementing cloud solutions and cloud security paradigms
Good understanding of key aspects of cloud infrastructure (security, scale, cost, etc.) in comparison with on-prem
Experience with log collection and analysis, builds and performance monitoring/tuning of infrastructure
Familiar with a wide variety of cloud services and open-source technologies is preferred
Experience with service-oriented architecture and/or microservices is a plus
Someone who has a passion for speed and efficiency through automation and reducing waste with a focus on quality, security, and metrics to drive continuous improvement
Must have in-depth experience managing Linux based workloads
Excellent communication skills

EDUCATION, CERTIFICATION AND EXPERIENCE
8+ years of relevant experience.
Bachelor’s or master’s degree in computer science or equivalent

Why join us?
- Operative is a technology-oriented product organization that believes in empowering its people
- We use the latest tech stack and empower our engineers to learn, work and ideate on new technologies available in the market
- We provide flexible work schedules and remote working to encourage work life balance
- We are an equal opportunities employer and recruit based on the experience and skill set.
Please apply online and upload your CV.
“Operative is a merit-first, equal opportunity employer; diverse applications are encouraged.”
Operative cares about your privacy and protecting your data. By submitting an application for a position with Operative, you acknowledge that you have read the following and consent to how Operative treats your data: 1) the Candidate Privacy Policy available at https://www.operative.com/candidate-privacy-notice/ (or if you are a candidate from Israel the Candidate Privacy Notice (Israel), available at https://www.operative.com/candidate-privacy-notice-israel/, and 2) the Candidate Notice for Data Transfer and Retention available at https://www.operative.com/candidate-notice/.

Principal System Engineer

AI overview

Perks & Benefits Extracted with AI