Voxel
Voxel

Senior Software Engineer, ML Infrastructure

$200,000 – $240,000 per year

TLDR

Own the ML Infrastructure for training and deploying vision models at scale, establishing best practices and optimizing various related processes within the perception team.

Who We Are

Voxel is building the future of Computer Vision and Machine Learning for operations, risk, and safety. We use computer vision and AI to enable existing security cameras to automatically detect hazards and high-risk activities, keep people safe and drive operational efficiencies. Our technology addresses the key cost drivers for workers’ compensation, general liability, and property damage, which cost US employers over $500 billion annually. Our customers include Fortune 500 companies across grocery, retail, manufacturing, food and beverage, logistics, and pharmaceutical distribution. We’ve passed $10M ARR with strong expansion revenue. Based in SF, backed by industry-leading VCs.

 

About the Role

Voxel’s perception system is the technical core of everything we ship. Our models detect human activity, equipment interactions, environmental hazards, and operational state in real time across thousands of cameras in manufacturing, logistics, retail, and pharmaceutical environments. Safety was our wedge; it proved our platform works. Now customers are pulling us into operations: equipment utilization, workflow compliance, process efficiency. Every new use case runs through the perception team.

We're hiring a strong software engineer to own the ML Infrastructure that powers how Voxel trains and ships vision models. You’ll build systems that let our applied ML team train multiple models concurrently, manage experiments and ship optimized models to production. You'll set technical direction, write code, make architecture calls, and partner closely with applied CV, ML Data and Platform engineers.

What You'll Do

  • Build and maintain training infrastructure that lets the applied ML team train multiple models concurrently, manage experiments, and iterate quickly on new architectures.

  • Own the train-to-deploy handoff - export trained models to optimized inference formats (TensorRT, ONNX), quantify accuracy and latency impact, and partner with Platform on production deployment.

  • Establish ML experiment tracking and lifecycle management - pick the right tools (Weights & Biases, MLflow, ClearML, or similar) so researchers can run, compare, and reproduce experiments efficiently.

  • Establish DevOps-for-ML best practices on AWS (IaC, CI/CD, observability, cost monitoring) so researchers can iterate quickly and safely.

  • Understand the infra needs of applied ML/CV engineers and design scalable solutions that support model development.

What We're Looking For

  • 4+ years of experience building and shipping large scale software solutions.

  • Hands-on experience building ML training pipelines in PyTorch.

  • Hands-on experience with ML experiment tracking and lifecycle tools (Weights & Biases, MLflow, ClearML, or similar).

  • Experience with AWS (S3, EC2, EKS, or similar) for ML workloads.

  • Strong Python. Write performant code that scales well in production environments.

  • Track record of owning infrastructure end-to-end: scoping, building, shipping, and improving systems that internal teams depend on.

  • Bias toward shipping. You'd rather ship something good this week than something perfect next quarter.

  • Strong communication skills.

Nice to Have

  • Experience with modern ML orchestration tools (Ray, Sematic, Flyte, Metaflow, Prefect, or similar)

  • Familiarity with GPU performance profiling and optimization (Nsight, PyTorch profiler, or similar)

  • Background in computer vision model training

Compensation & Benefits

  • Equity through Voxel’s Equity Incentive Plan

  • Total compensation includes base salary, annual bonus, and equity

  • Comprehensive health, dental, and vision insurance

  • Competitive paid parental leave

  • Unlimited PTO and flexible work arrangements

  • Daily meals in-office, team events, annual company onsite

Voxel leverages computer vision and AI to transform existing security cameras into proactive safety tools, enabling them to automatically detect hazards and high-risk activities. Targeting Fortune 500 companies, our platform enhances operational efficiencies and addresses significant risk and cost factors in workplace safety.

Founded
Founded 2020
Employees
51-200 employees
Industry
Internet Software & Services
Total raised
$30M raised
View company profile
Report this job
Apply for this job