AI Developer - Voice & Multimodal AI

AI overview

Work at the cutting edge of LLMs and voice AI to directly impact millions of users by powering AI agents for hiring and career growth, while collaborating in a high-speed, innovative environment.

About Apna:

Apna is India’s largest jobs and professional networking platform for frontline workers. We’re building the infrastructure to power hiring, skill-building, and career growth for 300 million+ working Indians. As we expand our AI-first platform across voice, text, and multimodal workflows — we’re looking for a bold and curious AI Data Scientist who wants to shape the future of applied Gen AI.

Requirement: 1

Location: Bengaluru (Work from Office - Domlur)

Team: AI & Machine Learning

Experience: 3–5 years

Requirements

What You'll do:

  • Fine-tune and deploy LLMs, TTS, STT, and voice models for use in real-time conversations with millions of users.
  • Convert unstructured, messy real-world audio/text data into clean, high-quality datasets for training and evaluation.
  • Build inference pipelines optimized for low-latency, high-accuracy voice agents and multimodal interfaces.
  • Work closely with infra and product teams to ship production-grade GenAI models with observability, fallback, and monitoring.
  • Experiment with GANs, diffusion models, audio generation, and multimodal fusion to power next-gen AI agents.
  • Own the full model lifecycle — from research and training to deployment, testing, and iteration.

What we're Looking for:

  • 3–5 years of hands-on experience in AI / ML roles, ideally in startups or product-driven teams.
  • Strong grasp of LLM fine-tuning, instruction tuning, or pretraining techniques.
  • Familiarity with TTS/STT systems, Whisper, Tacotron, VITS, or commercial tools like ElevenLabs.
  • Experience with multimodal architectures, generative audio, GANs, or diffusion-based models.
  • Ability to work with real-world messy data, design training pipelines, and debug model failure modes.
  • Fluency in frameworks like PyTorch, HuggingFace, TensorFlow, and ecosystem tools (ONNX, Triton, LangChain, etc.).
  • Passion for building high-impact AI features that ship to real customers.

Benefits

Why Join Us:

  • Work at the cutting edge of LLMs, voice AI, and generative models — and ship real products, not just prototypes.
  • Directly impact millions of users by powering AI agents that help with hiring, learning, and career growth.
  • Collaborate with a world-class team of AI engineers, researchers, and product minds who move fast and ship boldly.
  • Freedom to explore: Own experiments, propose architecture, or contribute to foundational model training.
  • Startup speed, enterprise scale — best of both worlds. Rapid iteration and direct customer feedback.
  • Multilingual India - first problems that push the boundaries of speech, reasoning, and personalization.

Founded in 2019, the Apna mobile app is India’s largest professional networking platform dedicated to helping India’s burgeoning working class to unlock unique professional networking, and skilling opportunities. The app is currently live in 14 cities - Mumbai, Delhi-NCR, Bengaluru, Hyderabad, Pune, Ahmedabad, Jaipur, Ranchi, Kolkata, Surat, Lucknow, Kanpur, Ludhiana, and Chandigarh. Having raised $90+ million from marquee investors like Insight Partners, Tiger Global, Lightspeed India, Sequoia Capital, Rocketship.vc and Greenoaks Capital, Apna is on a mission to enable livelihoods for billions in India. With over 10 million users, present in 14 cities and counting, and over 100,000 employers that trust the platform - India has a new destination to discover relevant opportunities.

View all jobs
Get hired quicker

Be the first to apply. Receive an email whenever similar jobs are posted.

Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Developer Q&A's
Report this job
Apply for this job