Software Development Engineer Voice

AI overview

Contribute to building a distributed real-time communications platform by integrating AI models and developing backend systems, impacting global products handling over 1 billion API requests monthly.
Plivo is a leading technology company transforming customer engagement for some of the world’s largest B2C brands, including Uber, WhatsApp, and Zomato. Our new product - the AI agents platform, automates the entire customer lifecycle - from acquiring, engaging, and supporting customers - through cutting-edge multimodal AI, including LLMs, text-to-speech, and speech detection. With a 100+ member team based out of India & US. We are building high-impact global products that handle over 1 billion API requests per month. If you are excited about solving hard, real-world AI challenges at scale, this is where you belong Role overview This is a deep systems and multidisciplinary role that bridges real-time communications (RTC), VoIP infrastructure, backend systems, and AI model development. You’ll architect and build a distributed RTC platform across the globe, develop backend services, and integrate AI models into production-grade voice and multimodal experiences at scale. If you love low-latency systems, real-time voice engineering, and AI-driven innovation, this is where you belong. What You’ll Do
  • Design and build real-time voice systems using WebRTC, SIP/RTP and Websocket streaming.
  • Engineer backend infrastructure for signaling, routing, call control, and audio/video processing.
  • Work with open-source RTC stacks -- Freeswitch, Kamailio, Livekit, RTPEngine, and Pipecat.
  • Develop and integrate AI capabilities, including: TTS (Text-to-Speech), STT (Speech-to-Text), VAD (Voice Activity Detection), Media servers and AI voice agentsBuild and scale a global, distributed RTC platform with strong resilience, observability, and low latency.I ntegrate AI/ML models into real-time voice systems (speech recognition, synthesis, embeddings).
  • Build and scale a global, distributed RTC platform with strong resilience, observability, and low latency.
  • Work across the stack -- from C/Go/Rust real-time components to Python/Node.js backend services, and our SDKs.
  • Collaborate cross-functionally with Product, and DevOps teams.
  • Instrument and monitor systems for quality, latency, and performance.
  • Prototype rapidly: build, test, iterate, and deploy new RTC + AI features.
  • Contribute to open-source voice and AI ecosystems.
  • Be hands-on: Debug issues, tune queries, optimize performance, and improve resiliency, you own your code from dev to prod.
  • Don’t be afraid to jump on a call or chat with a customer to ensure they have a smooth experience -- you own the outcome, not just the code.
  • Use AI-assisted development tools to improve coding speed, testing, and code quality.
  • What You Bring
  • Strong foundation in systems programming -- C, Go, and/or Rust.
  • Experience in backend and real-time systems engineering.
  • Expertise in WebRTC, SIP, VoIP, and signaling/audio pipelines.
  • Hands-on with open-source RTC stacks: Freeswitch, Kamailio, Livekit, RTPEngine, Pipecat
  • Understanding of media negotiation, codec pipelines, and audio/video streaming.
  • Knowledge of real-time networking (UDP/TCP, ICE, NAT traversal).Experience building and scaling distributed RTC platforms.
  • Experience with AI voice systems -- TTS, STT, VAD, LLM voice agents, or speech embeddings.
  • Familiarity with AI/ML frameworks (PyTorch, TensorFlow, ONNX) or model integration.
  • Backend development experience in Python or Node.js.
  • Strong debugging, profiling, and performance optimization skills.
  • Builder mindset: proactive, curious, and thrives in complex systems.
  • Bonus Points
  • Contributions to open-source RTC or AI projects.
  • Familiarity with LLM integration and multimodal AI (voice + text).Experience in edge computing or real-time streaming optimization.
  • Exposure to audio signal processing or DSP algorithms.
  • Experience deploying real-time systems on cloud (AWS/GCP) with Docker/Kubernetes.
  • Experience with AI voice agents, or voicebots.
  • Voice & SMS API Platform: Plivo enables businesses and developers to tap into powerful Voice and SMS capabilities without carrier lock-in.

    View all jobs
    Get hired quicker

    Be the first to apply. Receive an email whenever similar jobs are posted.

    Ace your job interview

    Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

    Software Development Engineer Q&A's
    Report this job
    Apply for this job