PolyAI is hiring a

Software Engineer - Model Evaluation and Productisation (Must be UK based)

London, United Kingdom
Full-Time

PolyAI is a leader in automating customer service through innovative voice technology. Our voice assistants empower businesses to deliver exceptional customer service at every interaction.

We are seeking a talented and hands-on Software Engineer with a strong data science background to join our team.

In this role, you will work on building software to enhance the visibility and configurability of large language models (LLMs). You will be responsible for rapidly developing tools and platforms to evaluate, iterate, and productionalize models, ensuring their reliability and accuracy.

We are looking for the right candidate, and therefore are flexible on the levelling for this position ranging from mid- level to senior!

Responsibilities:

  • Must have at least 2 years of Python experience
  • Must have at least 2+ years working experience
  • Develop software that provides visibility into LLM models and offers configurability for tuning and evaluation.
  • Build and maintain evaluation datasets and tools, enabling the measurement of model performance across key metrics.
  • Take a hands-on approach to quickly prototype, test, and iterate on solutions, moving from proof-of-concept to production.
  • Employ a data-driven methodology to drive model accuracy, leveraging evaluation results to inform decisions.
  • Collaborate with cross-functional teams to integrate developed tools and ensure they meet production standards.
  • Formulate hypotheses, design experiments, and collect data to validate model assumptions, consistently striving for improved reliability.
  • Communicate findings and ranking metrics clearly to both technical and non-technical stakeholders.

Why Join Us:

Join a dynamic and innovative team at the forefront of LLM development. You will have the opportunity to work on challenging projects, rapidly build impactful solutions, and drive data-informed improvements that push the boundaries of what LLMs can achieve.

Requirements

  • Degree in Computer Science, Data Science, or related field, or equivalent experience.
  • Strong proficiency in Python
  • Experience developing evaluation suites, datasets, and data-driven tools for model reliability testing.
  • Ability to rapidly prototype and iterate on solutions while maintaining a focus on production-level quality.
  • Strong problem-solving skills and a creative mindset, with the ability to hypothesize and validate results through experimentation.
  • Familiarity with cloud platforms such as AWS, GCP, or Azure is a plus.

Benefits

💰 Participation in the company’s employee share options plan

🏝 25 days holiday, plus bank holidays

🏡 Flexible working from home policy plus a one-off WFH allowance when you join

🌎 Work from outside of the UK for up to 6 months each year

🧡 Enhanced parental leave

📚 Yearly learning budget

🚲 Bike2Work scheme

📚 Annual learning and development allowance

🏡 One-off WFH allowance when you join

👨‍👩‍👧 Company-funded fertility and family-forming programmes

🌸 Menopause care programme with Maven

🏥 Private healthcare and dental cover, discounts on gym members and relaxation apps, and access to a range of mental health programs

Equal Opportunity Statement:

PolyAI is proud to be an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.

Apply for this job

Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!

Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Software Engineer Q&A's
Report this job
Apply for this job