About the Role
Cyberhaven’s lineage technology tracks billions of data events. Your mission is to build the service layer that wraps this data in an agentic framework. You will create tools that allow our "Professional Services" to scale by deploying fleets of AI agents that can:
Conduct Autonomous Audits: Automatically map a customer's data flow and identify "Shadow AI" or high-risk data egress points.
Generate Intelligence-Driven Policies: Move beyond static templates to provide custom, context-aware security recommendations.
Diagnose at Lineage-Speed: Build "Diagnostic Agents" that can trace a single file's journey through months of history to explain a breach in seconds.
What You’ll Do
Build the "Service-as-a-Software" Layer: Develop the backend services that enable AI agents to perform complex, multi-step tasks traditionally handled by security consultants.
AI Agents: Implement AI agents that will leverage product data to build tools and recommendations.
Data Integration (BigQuery): Write high-performance SQL and data-extraction logic in BigQuery to feed agents the rich, structured context they need from our massive lineage datasets.
API-First Integration: Design and consume robust RESTful APIs that connect our core platform to LLM providers (Glean) and downstream customer workflows.
Automated Diagnostics: Create the logic for "Reasoning Loops" where agents can query BigQuery, interpret security logs, and output a human-readable diagnosis of complex data security issues.
Who You Are
1. Scripting & Agentic Development
Advanced Python/Go: Deep experience in building services.
Agent Frameworks: Familiarity with building "Agentic" workflows (handling state, long-term memory, and tool-use) rather than just simple "Chat" interfaces.
2. Big Data & Analytics (BigQuery)
Complex SQL: Ability to join and analyze high-cardinality datasets. You should be comfortable optimizing queries that touch billions of rows of lineage data.
Data Modeling: Experience structuring data specifically for LLM consumption (e.g., creating efficient RAG embeddings or JSON-based context windows).
3. API & System Integration
RESTful Architecture: Experience using rest apis to build agentic services
Integration Ecosystems: Familiarity with integrating security tools (SIEM/SOAR) via APIs to close the loop from "Recommendation" to "Enforcement."
4. Leveraging AI via APIs
LLM Orchestration: Mastery of leveraging LLMs via API, including advanced prompt engineering, structured output (JSON mode), and managing token/latency trade-offs.
Guardrails & Validation: Experience building "verification" layers where a second agent or a deterministic script validates the primary agent's recommendations before they are presented to a customer.
Joining Cyberhaven is a chance to revolutionize data security. Traditional tools fall short, but we’ve reimagined protection with AI-enabled data lineage that analyzes billions of workflows to understand data, detect risk, and stop threats. Backed by $250M from leading investors like Khosla and Redpoint, our team includes leaders who built industry-defining technologies at CrowdStrike, Palo Alto Networks, Meta, Google, and more. This role lets you shape the future of data security, alongside experts driven to help customers protect their most valuable information.
Cyberhaven is committed to creating a diverse environment and is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Engineer Q&A's