About Us
Socket helps devs and security teams ship faster by cutting out security busywork. Thousands of orgs use Socket to safely find, audit, and manage open source code. Our customers — from Anthropic to xAI, and Figma to Vercel — love Socket (just check out their tweets to see for yourself!)
Founded by Feross Aboukhadijeh, a long-time open source maintainer with software downloaded over a billion times a month, Socket has raised $65M in funding from top angels, operators, and security leaders.
About the Role
We're looking for a Data Engineer to join our Data Platform team and build the infrastructure that powers Socket's data ecosystem. You'll design and maintain systems that handle billions of records, enable real-time analytics, and power the insights our customers rely on to secure their software supply chain.
This is a high-impact role where you'll work across the stack- from ingestion pipelines to analytics APIs - ensuring data flows reliably and is accessible when teams need it.
What You'll Do
Design and build scalable data pipelines that ingest, process, and transform high-volume event streams and historical data
Develop and maintain APIs that deliver analytics, trend reports, and drill-down capabilities to internal teams and external customers
Build robust infrastructure for data quality monitoring, ensuring accuracy and completeness across customer and artifact datasets
Optimize data storage and query performance using systems like ClickHouse, Kafka, NATS, and PostgreSQL to support real-time and batch use cases
Implement usage tracking, auditing, and event processing systems that provide visibility into platform behavior
Create reliable data ingestion systems for security scan results, SBOM data, and artifact metadata
Build infrastructure for outbound integrations that deliver Socket data to customer systems
Collaborate with product, security research, and engineering teams to understand data needs and deliver solutions that scale
What You'll Bring
3+ years of experience building data pipelines and infrastructure in production environments
Strong proficiency in Node.js and TypeScript for backend development
Experience with streaming data platforms (e.g., Kafka, NATS, RabbitMQ, Kinesis) and event-driven architectures
Hands-on experience with columnar databases and OLAP systems (ClickHouse, BigQuery, Snowflake, DuckDB, or similar)
Solid understanding of data modeling, schema design, and query optimization
Familiarity with cloud data lake technologies (Delta Lake, Iceberg, Parquet, etc.)
Experience building REST APIs and data access layers for analytics use cases
Comfort working with large-scale distributed systems and debugging performance bottlenecks
Strong ownership mindset - you take responsibility for the systems you build and ensure they're reliable
Clear communication skills; you can explain technical trade-offs to both engineers and non-technical stakeholders
Nice to Have
Experience with time-series data and real-time analytics
Familiarity with security or DevOps tooling ecosystems
Background working with SBOM formats or supply chain security concepts
Experience with data quality frameworks and observability tools
Understanding of multi-tenant architectures and data isolation patterns
We know how important clarity is when looking for a new role, so we've put together a read-me about the Interview Process at Socket.
Benefits: Our benefits are crafted to support you and your family, so you can take care of what matters most and thrive in and outside of work. We offer:
Market competitive salary bands
Meaningful equity program
Comprehensive health benefits for you and your family
Flexible time-off, holidays, and winter shutdown to rest & recharge
Paid parental leave
Remote-first, with quarterly team off-sites
At Socket, we
Pursue Excellence: We set ourselves apart by consistently delivering work of exceptional quality and distinction.
Move with urgency and focus: We prioritize swift, decisive action.
Think rigorously: We care about being right and it often takes reasoning from first principles to get there. We value alternative perspectives and have constructive discussions.
Trust and amplify: We overtrust, always assume good intent, and give specific feedback to help each other improve.
Feel a strong sense of ownership: We wear many hats and feel a strong sense of overall ownership of the company and we're non-territorial regarding our nominal domains.
Are customer obsessed: We relentlessly prioritize the needs of our customers, striving to exceed their expectations and delight them at every interaction.
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Senior Data Engineer Q&A's