Senior Machine Learning Engineer

Zürich , Switzerland

Employment Type: Full-Time
Location: Zurich, on-site (hybrid)

We are seeking a highly skilled Senior Machine Learning Engineer to join our team, focusing on post-training and optimizing large language models (LLMs), their integration with retrieval-augmented generation (RAG), and multi-agent architectures. The candidate will play a crucial role in improving the core of our LLM-based solutions through applied research, innovative designs, solid implementation, systematic evaluation, and impactful optimizations. You will collaborate closely with machine learning engineers, software engineers, and domain experts to push the frontier of LLM for code generation in our products.

Key Responsibilities:

  • Design LLM training datasets for instruction following, question answering, reasoning, conversational models, and code generation to improve model accuracy, robustness, and generalization.
  • Lead post-training optimization efforts by integrating curated content into fine-tuning pipelines tailored to specific use cases.
  • Define processes for creating and validating high-quality LLM training data (code, text, prompts, dialogues, scenarios) and synthetic data generation.
  • Establish systematic evaluation and benchmarking frameworks to assess LLM performance.
  • Architect, refine, and evaluate engines for retrieval-augmented generation to enhance accuracy and speed of LLM integrations.
  • Develop and apply sophisticated multi-agent architectures and prompt engineering to optimize our LLM solutions.
  • Lead experiments and apply the latest research directions in LLMs, ensuring impactful implementation.
  • Collaborate cross-functionally with product managers, data scientists, and engineers to align solutions with business and technical goals.
  • Implement high-quality LLM components, tools, and integrations, accompanied by comprehensive specifications and documentation.

Required Qualifications:

  • Bachelor’s, Master’s, or PhD degree in Computer Science, Data Science, or a related field.
  • 2+ years of professional experience with large language models or machine learning frameworks, specifically in post-training, fine-tuning, benchmarking, and RAG.
  • Strong proficiency in Python and extensive experience with ML frameworks such as PyTorch or TensorFlow.
  • Proven ability to work effectively in cross-functional teams.
  • Strong analytical skills with a pragmatic approach to implementing impactful solutions.

Preferred Qualifications:

  • Extensive experience with LLM architectures like BERT, Llama-3, or similar.
  • Experience working with LLMs for code generation, particularly in low-resource programming languages.
  • Deep understanding of reinforcement learning from human feedback (RLHF) and reinforcement learning from verifiable rewards (RLVR).
  • Hands-on experience with tools for annotation, labeling, and dataset management.
  • Strong understanding of bias and fairness issues in AI and practical experience mitigating these issues.
  • Proficiency in TypeScript

 

About DFINITY and the Internet Computer:

DFINITY is a leading contributor to the Internet Computer Protocol (ICP), with a mission to bring the world's compute onto the secure ICP network. Built on its unique third-generation blockchain technology, ICP enables the development and operation of a new generation of unstoppable, tamper-proof, fully decentralized web applications. Its powerful technology can run entire AI models within smart contracts, representing a major advancement for secure AI. Through seamless integration with Bitcoin, Ethereum, and other networks, ICP facilitates multi-chain operations for digital assets and web3.

Join our team of over 250 talented individuals, including world-renowned cryptographers, distributed systems engineers, programming language experts, and industry leaders, who are shaping the future of the internet and web3.
 
DFINITY was founded in 2016 by entrepreneur and crypto theoretician, Dominic Williams.

All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

The DFINITY Foundation is a not-for-profit organisation developing the Internet Computer.

View all jobs
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Senior Machine Learning Engineer Q&A's
Report this job
Apply for this job