The goal is to create an efficient system for collecting, processing, analyzing, and visualizing large amounts of data from various sources
- Lay down the foundation and model the data
- Develop a platform for building and maintaining data pipelines that collect data from different sources
- Create Data Warehouse
- Analyze and prepare domain descriptions in collaboration with Business analytics
- Create Data Marts
- Design a permission model with flexible control over vertical and horizontal access to data
- Conceptual knowledge of data analytics fundamentals, e.g., dimensional modeling, ETL/ELT, reporting tools, data governance, data warehousing, structured and unstructured data
- Strong SQL knowledge and experience with RDBMS, confident knowledge of database fundamentals
- Experience in database development and data modeling, ideally with Databricks/Spark
- Experience with Python
- Experience with Azure
- Working knowledge of serialization formats and their trade-offs (columnar vs. row-based)
- Experience debugging and optimizing Spark jobs
- Strong written and verbal communication skills
- At least an Upper-Intermediate level of English
- BS in Computer Science or a related field
WOULD BE A PLUS
- Experience with a Business Intelligence tool