Site Reliability Engineer (SRE) - Media Production Infrastructure

Cupertino , United States

AI overview

Join a dedicated team of on-site SREs to support a world-class media production environment, focusing on high availability and continuous infrastructure improvement.

Please note that we will never request payment or bank account information at any stage of the recruitment process. As we continue to grow our teams, we urge you to be cautious of fraudulent job postings or recruitment activities that misuse our company name and information. Please protect your personal information during any recruitment process. While Monks may contact potential candidates via LinkedIn, all applications must be submitted through our official website (monks.com/careers).

About the Role

We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our Platform Engineering team, supporting a world-class media production environment for a leading global technology company. This is a crucial role within a Managed Services model, focused on ensuring the high availability, performance, and resilience of critical server, storage, and media workflow systems. You will be one of two dedicated on-site SREs who will partner with remote and consulting staff to provide around-the-clock operational support and continuous infrastructure improvement.

Key Responsibilities

  • Infrastructure Management: Maintain and troubleshoot all production hardware, servers, and storage infrastructure, with a specialized focus on the Storage Area Network (SAN).
  • Storage Expertise: Execute key maintenance and support for the SAN environment, including firmware/software updates for fiber switches, RAIDs, and ape systems.
  • Networking and System Admin: Manage Directory services, network services (DNS, static IPs, subnet masks), and configure shares and permissions on the SAN.
  • Monitoring and Observability: Manage and improve custom dashboards for 24/7 monitoring of systems, RAIDs, temperature sensors, and backup/archive processes.
  • Custom Application Support: Contribute to the development and maintenance of custom applications and dashboards that support media workflows, including tools for project deployment, directory services integration, and ticketing.
  • Remote/On-Demand Support: Provide active on-site support and participate in a 24/7 on-call rotation for critical interventions (e.g., power/cooling issues).
  • Backup and Archive: Manage the Backup and Archive environment, maintain tape systems, and prepare projects for archiving to the cloud.

Qualifications & Experience

  • Experience: 14+ years of experience working with macOS and SAN environments, preferably Xsan.
  • Experience working with Stornext and Jamf 
  • Technical Depth:
    • Deep expertise in Fibre Channel networking.
    • Demonstrated experience with hardware RAIDs, block storage, and LUN creation.
    • Thorough knowledge of macOS ACLs, POSIX permissions, and Directory Services.
    • Expertise in installing and configuring Prometheus and Grafana, including creating Prometheus exporters.
  • Software & Scripting:
    • Experience with Shell Scripting
    • Experience with remote connection technologies 
    • Thorough knowledge of data management for media and entertainment

 

Please note: This position requires on-site presence in SCV/Cupertino three days per week, including Saturday and Sunday. The third on-site day may be scheduled on Tuesday, Wednesday, or Thursday. The remaining two days may be worked remotely.

What We Offer

.Monks has provided a compensation range that represents its good faith estimate of what Media.Monks may pay for the position at the time of posting .Monks may ultimately pay more or less than the posted compensation range. The salary offered to the selected candidate will be determined based on job-related factors, but not based on a candidate’s sex or any other protected status.

 Salary range

$133,298.00 - $150,925.00 USD

About Monks

Monks is the global, purely digital, unitary operating brand of S4Capital plc. With a legacy of innovation and specialized expertise, Monks combines an extraordinary range of global marketing and technology services to accelerate business possibilities and redefine how brands and businesses interact with the world. Its integration of systems and workflows delivers unfettered content production, scaled experiences, enterprise-grade technology and data science fueled by AI—managed by the industry’s best and most diverse digital talent—to help the world’s trailblazing companies outmaneuver and outpace their competition.

Monks was named a Contender in The Forrester Wave™: Global Marketing Services. It has remained a constant presence on Adweek’s Fastest Growing lists (2019-23), ranks among Cannes Lions' Top 10 Creative Companies (2022-23) and is the only partner to have been placed in AdExchanger’s Programmatic Power Players list every year (2020-24). In addition to being named Adweek’s first AI Agency of the Year (2023), Monks has been recognized by Business Intelligence in its 2024 Excellence in Artificial Intelligence Awards program in three categories: the Individual category, Organizational Winner in AI Strategic Planning and AI Product for its service Monks.Flow. Monks has also garnered the title of Webby Production Company of the Year (2021-24), won a record number of FWAs and has earned a spot on Newsweek’s Top 100 Global Most Loved Workplaces 2023.

 

We are an equal-opportunity employer committed to building a respectful and empowering work environment for all people to freely express themselves amongst colleagues who embrace diversity in all respects. Including fresh voices and unique points of view in all aspects of our business not only creates an environment where we can all grow and thrive but also increases our potential to produce work that better represents—and resonates with—the world around us. 

S4Capital focuses on building a digital advertising and marketing services business that disrupts traditional models by integrating content, data, and technology to deliver seamless, always-on solutions for a diverse range of clients, including global ...

View all jobs
Salary
$133,298 – $150,925 per year
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Site Reliability Engineer Q&A's
Report this job
Apply for this job