At the API Infrastructure team, our mission is to make our API platform the easiest to use by ensuring it is reliable, performant, and scalable. Internally, we serve as the vital connection between our customers and our cutting-edge models. We pride ourselves on creating the highest standards of infrastructure, which are critical for the operational excellence of our customers’ applications.
We are looking for an experienced engineering manager to support our API Infrastructure team. You will work closely with other teams in API and other infrastructure teams to meet our API platform’s strong scaling needs. Above all, you will be responsible for ensuring the foundation of our system can meet the evolving demands of our customers on system reliability and performance.
In this role, you will:
Manage, build out, and mentor a team of high performing backend and infrastructure engineers.
Collaborate closely with application and infrastructure teams to push to the boundaries of reliability, performance and scale of large language model API services.
Create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think
You might thrive in this role if you:
Have 4+ years of experience in engineering management and 7+ years as an IC working with high scale distributed systems and ML systems
Have experience with highly available, reliable, production grade distributed systems at scale. Deep technical depth and expertise on scaling infrastructure systems during the growth phase
Care deeply about diversity, equity, and inclusion, and have a track record of building inclusive teams
Have experience closing extremely competitive candidates for your team, and the ability to craft and convey compelling visions of the future
Are comfortable with ambiguity and rapidly changing conditions. You view changes as an opportunity to add structure and order when necessary.
Extremely good collaborator.
Preferred
Direct experience working with Kubernetes
Experience working with Azure, or cloud infra (AWS, GCP)
Experience in drastically improving distributed systems’ reliability and latency
Prior experience working at a hyper growth company, and growing a team in that environment
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.
OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement
For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.