Software Engineer, Web Crawling
TLDR
Contribute to a massive-scale web crawler and ML-driven search engine, building distributed crawling, performance optimization, and handling dynamic content at scale.
Exa is an applied AI lab building a search engine unlike the world has ever seen. We build massive-scale infra to crawl the entire web, train state-of-the-art embedding models to process it, and design super high performant vector databases to retrieve over it. We now power search for Cursor, Cognition, HubSpot, and over 400,000 developers and have raised $350m from Lightspeed, Benchmark, and a16z.
Our ultimate goal is to build perfect search over all the world's information, far beyond Google. If you want to build massive-scale ML systems that will define the way the new AI world consumes information, this is the place for you.
As a Web Crawler engineer, you'd be responsible for crawling the entire web. Basically build Google-scale crawling!
Who You Are
You have extensive experience building and scaling web crawlers, or would be excited to ramp up very quickly
You have experience with some high performance language (C++, Rust, etc.)
You are familiar with TypeScript, Playwright, modern web design, CDP (Chrome DevTools Protocol)
You’re comfortable optimizing a system to an exceptional degree
You care about the problem of finding high quality knowledge and recognize how important this is for the world
What You Could Do
Build a distributed crawler that can handle 100M+ pages per day
Optimize crawl politeness and rate limiting across thousands of domains
Design systems to detect and handle dynamic content, JavaScript rendering, and anti-bot measures
Create intelligent crawl scheduling and prioritization algorithms for maximum coverage efficiency
Logistics
Location: This is an in-person opportunity in Singapore.
Visas: We’re happy to sponsor international candidates! While we cannot guarantee your visa, we have historically been successful in sponsoring candidates from all over the world. If you receive an offer, our team will work hard to get you a visa.
Benefits: We offer premium healthcare benefits (medical, dental, vision), fertility benefits, 16 weeks of fully paid parental leave for all new parents, and a monthly wellness stipend to all of our employees.
Benefits
Health Insurance
premium healthcare benefits (medical, dental, vision)
Paid Parental Leave
16 weeks of fully paid parental leave for all new parents
Visa Sponsorship
Visas: We’re happy to sponsor international candidates! While we cannot guarantee your visa, we have historically been successful in sponsoring candidates from all over the world. If you receive an offer, our team will work hard to get you a visa.
Wellness Stipend
monthly wellness stipend to all of our employees
Exa develops a real-time AI search engine and web crawling API, tailored for users seeking to extract structured data from websites. Our platform is uniquely built from the ground up to support every AI application, leveraging high-performance infrastructure and advanced embedding models to push the limits of web search capabilities.
- Founded
- Founded 2016
- Employees
- 1-10 employees
- Industry
- Internet Software & Services