Staff Engineer, Machine Learning

AI overview

Design and build core backend services for AI/ML runtime while optimizing performance, reliability, and cost in production systems.

REQUIREMENTS:

  • Total experience of 6 years+
  • Strong expertise in Python and backend engineering with experience building scalable, distributed microservices.
  • Hands-on experience designing and delivering end-to-end RAG (Retrieval-Augmented Generation) workflows in production systems.
  • Solid understanding of ML solution design, including embeddings, retrieval, ranking, feature engineering, and evaluation strategies.
  • Experience with vector databases (FAISS, Pinecone, Milvus, Weaviate) and implementing chunking, indexing, vector search, re-ranking, caching, and memory patterns.
  • Knowledge of LLM/NLP engineering, including prompt engineering, model integration, orchestration tools (LangChain/LlamaIndex), and evaluation instrumentation.
  • Experience productionizing ML systems with observability, online/offline parity, and performance optimization across latency, throughput, and cost.
  • Strong backend integration skills using REST/gRPC APIs, Docker, Kubernetes, CI/CD, and cloud platforms (AWS/GCP/Azure).
  • Ability to independently design, ship, and operate reliable, scalable, and cost-efficient ML-backed backend systems with strong ownership mindset.

RESPONSIBILITIES:

  • Design and build core backend services powering AI/ML runtime including orchestration, session/state management, and tools/services integration.
  • Implement end-to-end retrieval and memory systems covering ingestion, embeddings, indexing, vector search, ranking, caching, and lifecycle management.
  • Productionize ML workflows with feature/metadata services, model integration contracts, and evaluation hooks.
  • Drive performance, reliability, and cost optimization with strong SLO ownership and observability practices (logs, metrics, tracing, guardrails).
  • Collaborate with applied ML teams on model routing, prompts/tools, evaluation datasets, and safe releases.
  • Translate business requirements into scalable technical designs, define NFR benchmarks, and review architecture for extensibility and best practices.
  • Lead troubleshooting, root-cause analysis, and POCs to validate technology and design decisions.

 

Bachelor’s or master’s degree in computer science, Information Technology, or a related field.

👋🏼 We're Nagarro.We are a digital product engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (19,500+ experts across 36 countries, to be exact). Our work culture is dynamic and non-hierarchical. We're looking for great new colleagues. That's where you come in!By this point in your career, it is not just about the tech you know or how well you can code. It is about what more you want to do with that knowledge. Can you help your teammates proceed in the right direction? Can you tackle the challenges our clients face while always looking to take our solutions one step further to succeed at an even higher level? Yes? You may be ready to join us.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Machine Learning Engineer Q&A's
Report this job
Apply for this job