What is this position about?
We are looking for a Senior Data Engineer to join a high-impact Customer Insights product engagement for a global QSR client. This role is hands-on-keyboard and focused on building enterprise-grade data pipelines that power a unified Analytics ID, modular data marts, and a scalable feature store. The ideal candidate brings deep expertise in Databricks, Spark (PySpark), SQL, and large-scale identity resolution pipelines, and thrives in complex, production-ready data environments.
Job Description
Design and build production-grade data pipelines in Databricks using Spark/PySpark and SQL.
Develop and maintain an Analytics ID stitching pipeline using deterministic and probabilistic matching techniques across multiple customer data sources.
Build and manage modular data marts (Identity, Behavior, Demographics) with independent refresh cadences.
Implement and maintain a scalable feature store supporting downstream analytics and data science use cases.
Own the end-to-end data lifecycle: ingestion, transformation, validation, deployment, monitoring, and optimization.
Develop data quality frameworks including schema drift detection, anomaly monitoring, match-rate validation, and automated deduplication audits.
Implement CI/CD processes for multi-environment promotion (dev/staging/prod) in Databricks environments.
Coordinate orchestration workflows and manage dependencies using Databricks Workflows or similar tools.
Collaborate closely with Data Architects and Client stakeholders to translate business rules into scalable technical solutions.
Produce comprehensive technical documentation including data contracts, lineage maps, architecture diagrams, and operational runbooks.
5+ years of experience in Data Engineering building production-grade data pipelines at scale.
Strong hands-on experience with Databricks and Apache Spark (PySpark preferred).
Advanced SQL skills (complex joins, CTEs, window functions, performance tuning).
Experience developing identity resolution or entity matching pipelines (deterministic and/or probabilistic).
Experience designing and implementing data marts or dimensional models (Kimball or similar).
Familiarity with data quality frameworks (schema drift detection, validation, anomaly monitoring).
Experience implementing CI/CD for data pipelines and managing multi-environment deployments.
Strong communication skills and ability to present technical concepts to non-technical stakeholders.
Experience using Jira for ticket tracking and Confluence for documentation.
Nice to Have:
Experience with third-party data providers (Epsilon, LiveRamp, Neustar).
Experience with feature stores (Databricks Feature Store, Feast, or similar).
Knowledge of Databricks Unity Catalog.
Experience managing large-scale customer data (transactions, loyalty, retail/QSR data).
Experience with Delta Lake / Lakehouse architecture.
Familiarity with orchestration tools such as Airflow.
Experience working in consulting or embedded enterprise client environments.
What about languages?
Advanced English level (written and spoken) required for client-facing collaboration and technical presentations.
How much experience must I have?
Minimum of 5 years of professional experience in Data Engineering roles working with large-scale distributed data systems.
Our Perks and Benefits:
📚Learning Opportunities:
👨🏽💻Travel opportunities to attend industry conferences and meet clients.
👩🏫 Mentoring and Development:
🎁 Celebrations & Support:
⚖️ Flexible working options to help you strike the right balance.
Other benefits may vary according to your location in LATAM. For detailed information regarding the benefits applicable to your specific location, please consult with one of our recruiters.
So what are the next steps?
Our team is eager to learn about you! Send us your resume or LinkedIn profile below and we’ll explore working together!
Please mention you found this job on AI Jobs. It helps us get more startups to hire on our site. Thanks and good luck!
Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.
Senior Data Engineer Q&A's