Working Student – GenAI / LLM Evaluation – Agentic AI / NLP (f/m/d)

TLDR

Contribute to evaluation and validation of GenAI systems in automotive applications by building datasets and enhancing testing frameworks.

Position Description

As a Working Student in the GenAI / LLM team at Cinemo, you will support the evaluation and validation of agentic AI systems and GenAI algorithms for NLP that power next-generation in-car experiences. You will help build datasets, extend evaluation tooling, and contribute to end-to-end testing workflows to ensure our non-deterministic AI components are measurable, reliable, and ready for real-world automotive environments across cloud-based services and in-vehicle platforms such as Android Automotive OS (AAOS) and Linux.

In this role, you will:

  • Support evaluation of agentic AI systems and LLM-based NLP features, including qualitative and quantitative analysis.

  • Create, curate, and maintain datasets for benchmarking, regression testing, and scenario coverage.

  • Extend and improve internal evaluation frameworks (metrics, dashboards, automated test runs).

  • Contribute to end-to-end testing of GenAI features within the in-car experience, including integration and validation workflows.

  • Document findings, track model/system changes, and communicate results clearly to the team.

  • Collaborate with engineers and researchers to translate evaluation insights into actionable improvements.

What you will need to succeed:

  • Ongoing Bachelor’s or Master’s studies in Computer Science, AI/ML, Data Science, Computational Linguistics, or a related field.

  • Hands-on programming skills in Python and a solid understanding of basic ML/NLP concepts.

  • Interest in GenAI / LLMs, agentic systems, and evaluation of non-deterministic AI behavior.

  • Experience with data handling and dataset creation (labeling, preprocessing, quality checks).

  • Familiarity with software testing concepts (e.g., unit/e2e testing, CI) is a plus.

  • Good written and spoken English communication skills.

  • The successful candidate will be based in Karlsruhe, Germany.

Cinemo GmbH builds cloud-based SaaS and IoT solutions that enhance user experiences across multi-platform ecosystems. Targeting both consumers and partners, the company leverages machine learning, data science, and advanced analytics to advance organizational capabilities and drive customer engagement.

View all jobs
Report this job
Apply for this job