Own and lead the lifecycle of RL environment projects, ensuring high-quality delivery while directly influencing revenue and client relationships with leading AI labs.
About Turing
Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises looking to deploy advanced AI systems. Turing accelerates frontier research with high-quality data, specialized talent, and training pipelines that advance thinking, reasoning, coding, multimodality, and STEM. For enterprises, Turing builds proprietary intelligence systems that integrate AI into mission-critical workflows, unlock transformative outcomes, and drive lasting competitive advantage.
Recognized by Forbes, The Information, and Fast Company among the world’s top innovators, Turing’s leadership team includes AI technologists from Meta, Google, Microsoft, Apple, Amazon, McKinsey, Bain, Stanford, Caltech, and MIT. Learn more at www.turing.com
Turing builds large-scale datasets and reinforcement learning (RL) environments that power post-training for the world’s leading AI labs and enterprises, including OpenAI, Anthropic, Google DeepMind, Microsoft AI, Amazon, Apple, and many more. We create RL environments to evaluate and improve our customers' models on complex, long-range, multi-step workflows across high-GDP-value domains such as Finance, Sales, Retail, Developer Tools, Collaboration, Customer Experience.
The environments vary depending on the model capability being evaluated / improved, a few examples of environment types are listed here:
We are looking for a Frontier Data Lead – RL to own the end-to-end lifecycle of RL environment projects, spanning environment design, task generation, reward/verifier design, quality, and delivery to frontier AI labs and enterprise clients.
This is a hands-on technical leadership role where you influence revenue directly – you will be mapped to one or more AI labs and build RL environments specific to their needs. You will lead teams of engineers, subject matter experts (e.g. Finance expert, if you’re building an RL environment for investment banking workflows), researchers, and data ops teammates to achieve this.
What You’ll Do
Compensation: $250,000 to $350,000 OTE + Equity
Don’t meet every single requirement? Studies have shown that women and people of color are less likely to apply to jobs unless they meet every single qualification. Turing is proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender identity, sexual orientation, age, marital status, disability, protected veteran status, or any other legally protected characteristics. At Turing we are dedicated to building a diverse, inclusive and authentic workplace and celebrate authenticity, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.
For applicants from the European Union, please review Turing's GDPR notice here.
Turing accelerates research for frontier AI labs by providing high-quality datasets and reinforcement learning environments. Serving global enterprises, Turing develops proprietary intelligence systems that seamlessly integrate AI into critical workflows to enhance performance and drive innovation.
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!