Modulate

Machine Learning Operations Engineer

Massachusetts, U.S.

Full-Time

Hybrid

$150,000 – $200,000 per year

TLDR

Own and scale the production inference systems for machine learning at Modulate, ensuring reliability and efficiency as customer usage rapidly grows.

Modulate is the leader in conversational voice intelligence. We enable enterprises to deeply understand how people communicate and take timely action based on those insights. Our products help detect harm, prevent fraud, and build safer, more trusted online and real-world voice environments. We are building a Conversation Intelligence Platform — APIs, workflows, and applications that bring voice understanding to customers at enterprise scale. We’re looking for a Machine Learning Operations Engineer to own and scale the production inference systems behind Modulate’s machine learning models. This role will focus on ensuring high availability, reliability, and efficiency of deployed models across our APIs and enterprise products as we rapidly grow in customer usage and model demand. Your Impact Own the reliability and performance of ML model inference systems in production Ensure high availability of deployed models across APIs and enterprise products Build systems to handle scaling, load variability, and production traffic growth Reduce operational burden through better tooling, automation, and processes Help define how Modulate runs ML systems at scale with reliability and efficiency What You Will Do Deploy, monitor, and maintain production machine learning inference systems Oversee fleets of inference machines and ensure system health and performance Design monitoring, alerting, and incident response systems for ML workloads Participate in on-call rotations and lead incident response and debugging Build systems and processes for scaling inference infrastructure under variable load Improve reliability and observability of production ML services Collaborate on infrastructure-as-code for production deployments Support or contribute to GPU-based training and inference infrastructure Work closely with ML and engineering teams to ensure smooth model deployments (Optional growth area) Optimize model inference performance and latency What We Are Looking For Experience deploying and maintaining production software systems Experience building monitoring and alerting systems for production environments Experience with on-call rotations and incident response Strong experience with AWS, Python, and Linux Exposure to PyTorch or similar ML frameworks Experience working with GPU-based applications and basic GPU tooling (drivers, runtime, monitoring) Strong debugging and systems thinking skills Ability to operate calmly in production incident environments Nice to Have Experience with ML model serving systems or dedicated model servers Experience monitoring GPU performance for inference workloads Experience optimizing machine learning model inference Familiarity with audio or multimedia data (codecs, streaming, real-time systems) Experience with infrastructure-as-code (e.g., Terraform, CloudFormation) Benefits Competitive salary + equity Full health, dental, and vision coverage Flexible PTO with strong culture of taking it Weekly team lunches with dietary accommodations Hybrid work with core in-office days and flexible remote options Leadership and technical learning sessions Career development and continued learning support Up to 8 weeks work-from-anywhere policy A deeply inclusive, human-centered culture Pay TransparencyModulate believes in transparency as a cornerstone of equity and trust. Compensation for this role is based on seniority, skills, and experience. Salary: $150-$200K Equity: Offered Other perks: HSA, FSA, 15 holidays, professional growth resources About ModulateModulate is on a mission to make voice a force for good online. Our tools help communities thrive by proactively detecting toxic behavior, protecting user identity, and empowering safety teams. We’re trusted by leaders in gaming and beyond—and we’re growing fast. We believe that great cultures don’t just happen. That’s why we’ve built a foundation of intentional systems: from bias-reducing hiring practices to transparent pay to tools that help teams collaborate across communication styles. At Modulate, we treat people like people—and we’re building technology that does the same. Ready to join us? Apply here or reach out directly—we’re excited to meet you. A quick note as you apply Please apply through the website rather than emailing [email protected] For application questions (“Your fit for the role,” “Your values/goals,” “Why Modulate?”), focus on relevant experience and motivations Avoid including protected demographic information Keep responses authentic and in your own voice

Benefits

Free Meals & Snacks

Weekly team lunches with dietary accommodations

Health Insurance

Full health, dental, and vision coverage

Work-from-anywhere policy

Up to 8 weeks work-from-anywhere policy

Paid Time Off

Flexible PTO with strong culture of taking it

Remote-Friendly

Hybrid work with core in-office days and flexible remote options

Apply for this job

Modulate

Modulate builds innovative voice chat technology designed to combat toxic behavior in online communities. Their flagship product, ToxMod, delivers a safe and immersive experience for gamers, addressing a critical need in the gaming industry by fostering inclusive interactions. Focused on enhancing player engagement and well-being, Modulate serves game developers and platforms looking to create healthier online spaces.

Founded: Founded 2017
Employees: 11-50 employees
Industry: Internet Software & Services
Total raised: $36M raised

View company profile

Machine Learning Engineer

Report this job