Sonatype
Sonatype

Senior Data Scientist

TLDR

As an internal AI consultant, you'll help teams apply machine learning and generative AI to real-world problems, exploring complex datasets and designing scalable solutions.

Sonatype is the software supply chain security company. We provide the world’s best end-to-end software supply chain security solution, combining the only proactive protection against malicious open source, the only enterprise grade SBOM management and the leading open source dependency management platform. This empowers enterprises to create and maintain secure, quality, and innovative software at scale. As founders of Nexus Repository and stewards of Maven Central, the world’s largest repository of Java open-source software, we are software pioneers and our open source expertise is unmatched. We empower innovation with an unparalleled commitment to build faster, safer software and harness AI and data intelligence to mitigate risk, maximize efficiencies, and drive powerful software development. More than 2,000 organizations, including 70% of the Fortune 100 and 15 million software developers, rely on Sonatype to optimize their software supply chains. The Opportunity ●      We’re looking for a Senior Data Scientist to join our growing AI & Data Science team. ●      You’ll operate as an internal AI consultant and technical lead, helping multiple teams across Sonatype apply machine learning and generative AI to real-world problems. ●      You’ll explore complex datasets, design experiments, build models, and collaborate closely with product engineering, and security experts to turn research ideas into practical, scalable solutions. ●      This role is ideal for someone who thrives on autonomy, loves translating ambiguous ideas into working systems, and enjoys working across boundaries rather than in a single product lane. What You’ll Do
  • Lead applied AI projects from concept to impact — prototype, validate, and help teams deploy practical ML and GenAI solutions.
  • Collaborate cross-functionally: Partner with product, engineering, and research teams to scope problems, identify opportunities, and co-develop solutions.
  • Act as an internal consultant: Advise teams on ML/AI best practices, model evaluation, and productive use of generative technologies.
  • Design robust experiments and establish evaluation pipelines for model reliability, accuracy, and business impact.
  • Bridge research and production: Package research insights into usable APIs, tools, or workflows for other teams.
  • Explore new techniques (e.g., LLMs, embeddings models, retrieval-augmented generation, agentic workflows) to enhance developer and security experiences.
  • Share knowledge and mentor peers, helping elevate the organization’s AI literacy and capabilities.
  • What We’re Looking For
  • 6+ years of experience in applied data science, machine learning, or AI research
  • Strong Python skills and hands-on experience with ML/AI libraries and platforms such as Databricks, OpenAI API, and Scikit-learn
  • Comfortable working with large, messy, or unstructured datasets — you know how to turn chaos into features, insights, and beautiful visualizations
  • Deep familiarity with LLMs and GenAI ecosystems (e.g. OpenAI, Claude, Hugging Face): skilled in prompt engineering, parameter tuning, and evaluating model behavior against ground truth
  • Experience taking ML or GenAI systems from prototype to production, even if small-scale or incremental
  • Strong analytical thinking, experimentation skills, and appreciation for trustworthy, data-driven evaluation
  • Proficiency with Git and collaborative code workflows (GitHub or similar)
  • A balanced mindset — equally comfortable exploring research ideas and implementing production-ready systems
  • Proactive and self-directed: you don’t wait for perfect specs; you find meaningful problems and drive them to completion
  • Bonus Points
  • Experience with AI-assisted coding tools (Copilot, Claude Code, Codex, etc.)
  • Familiarity with agentic workflows, Model Context Protocol (MCP), and tool-use integrations
  • Exposure to cybersecurity, anomaly detection, or code analysis
  • Understanding of MLOps practices (MLflow, AWS SageMaker, model serving, or monitoring)
  • Things we are proud of
     
    • 2025 AI Compliance Solution of the Year - AI Breakthrough Awards

    • 2025 DEVIES Award to our SBOM Manager new product for its innovation and impact in developer technology

    • 2024 Industry Leader in Forrester-Wave for Software Composition Analysis (2024 Q4 report)

    • 2023 Fast Company Best Places for Innovators

    • 2023 Gartner's Magic Quadrant

    • 2023 Software Report's Top 100 Software Companies

    • 2023 BuiltIn Best Places to Work

    • 2022 Frost & Sullivan Technology Innovation Leader Award

    • 2022 PeerSpot Silver Peer Award in Software Composition Analysis

    • 2022 Tech Ascension Best DevOps Security Solution Award

    • 2022 NVCT Cyber Company of the Year

    • Company Wellness Week - We shut down company operations for a week to enable all employees to pursue personal growth and enjoy a much-needed and deserved rest. 

    • Paid Volunteer Time Off (VTO)

    • Expansion of Sonatype’s India Innovation Hub in Hyderabad, reflecting our continued growth, commitment to innovation, and investment in talent to advance AI-driven software security globally

    At Sonatype, we value diversity and inclusivity. We offer perks such as parental leave, diversity and inclusion working groups, and flexible working practices to allow our employees to show up as their whole selves. We are an equal-opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you have a disability or special need that requires accommodation, please do not hesitate to let us know.



    Sonatype provides comprehensive software supply chain security solutions, empowering over 2,000 organizations to create and maintain secure software. Our distinctive approach combines proactive protection against malicious open source, enterprise-grade SBOM management, and a leading dependency management platform, making us a trusted partner for enterprises looking to innovate securely and efficiently.

    Founded
    Founded 2008
    Employees
    201-500 employees
    Industry
    Internet Software & Services
    Total raised
    $150M raised
    View company profile
    Report this job
    Apply for this job