Bosta
Senior Data Engineer
TLDR
Help rebuild Bosta’s data platform from end-to-end leveraging technologies like MongoDB and dbt, while owning the foundational code across the stack for data consumers.
About Bosta:
Launched in 2017, Bosta is an overnight delivery provider for E-commerce companies in Egypt. Leveraging technology to empower businesses is what we do best. Our goal is to disrupt the logistics industry by bringing technology and experience all together to provide a one-stop-shop for delivery solutions.
We rely on advanced technology to deliver orders to our customers in a fast way. Businesses can track and connect with couriers and follow up on their orders - all in one system.
About the role:
We’re rebuilding Bosta’s data platform end-to-end: from MongoDB at the source, through a governed semantic layer that LLM-native tools (NL-to-SQL agents, AI analysts, embedded copilots) can sit on top of safely and cheaply. You will help own that rebuild.This is a hands-on role. You’ll set patterns, write the foundational code, and ship across the stack — from CDC and ingestion through dbt, the semantic layer, and the interfaces that BI tools and AI agents consume.
Job Responsibilities
End-to-end pipeline work: MongoDB CDC → ingestion → lakehouse → warehouse → dbt → semantic layer → BI/AI consumers
Co-ownership of architecture decisions with the Data Engineering Lead
CDC from production MongoDB without degrading operational DB performance; ingestion patterns that make adding a new source a config change, not a project
Orchestration that’s observable end-to-end (Airflow, Dagster, or Prefect)
The dbt project: structure, conventions, tests, contracts, exposures, CI
The semantic / metrics layer (dbt Semantic Layer, Cube, or equivalent) — one canonical definition per business metric
LLM-readiness: column-level documentation, PII tagging, query cost guardrails, materialized metric tables, and evals on AI-generated SQL
Migration of existing logic out of the Tableau and Metabase sprawl into modeled, governed sources
Job Qualifications
4+ years across data engineering and/or analytics engineering — you’ve spent meaningful time on both sides
Comfort spanning the stack: comfortable shipping a Debezium connector one week and a dbt mart the next
Deep dbt and SQL — you’ve owned a non-trivial project, not just contributed to one
Production CDC experience (Debezium, Kafka Connect, Airbyte, or hand-rolled) against operational databases — bonus if that database was MongoDB
A cloud warehouse you know deeply (Redshift, Snowflake, BigQuery, or Databricks)
Strong Python; comfortable in Linux, infra-as-code, and CI/CD
Working understanding of how LLM tooling (RAG, NL-to-SQL, embedded agents) consumes a data platform — and what breaks when the platform isn’t ready
Strong opinions on modeling, lightly held; bias toward observability
MongoDB schema evolution at scale
Production semantic-layer rollouts (dbt Semantic Layer, Cube, LookML, MetricFlow)
Lakehouse formats (Iceberg, Delta, Hudi) or streaming experience (Kafka, Flink, Kinesis)
Data catalog / lineage tooling (DataHub, Atlan, Collibra)
Logistics, marketplaces, or other operationally heavy domains
Evals on AI-generated SQL or analytical reasoning
Bonus:
Thank you for considering Bosta as your next career step. We value every application we receive, and our selection process is designed to ensure the best fit for both our team and our candidates. We appreciate your patience throughout the process
Bosta is an overnight delivery provider focused on E-commerce companies in Egypt and KSA. By leveraging technology, we deliver fast, reliable logistics solutions that enable businesses to track and connect with couriers seamlessly through a unified platform, effectively disrupting traditional logistics models.
- Founded
- Founded 2016
- Employees
- 51-200 employees
- Industry
- Internet Software & Services
- Total raised
- $1.7M raised
Senior Data Engineer