We are looking for a Data Engineer with strong Python skills and a focus on data analysis. In this role, you will work closely with cross-functional teams to describe and document data flows and pipelines, format data, and inform data structures. You will also analyze business requirements and translate them into project requirements, ensuring the effective implementation of data solutions.
Key Responsibilities:
- Describe and document data flows and data pipelines
- Format data and describe data structures
- Analyze business requirements and translate them into project requirements
- Develop and maintain data pipelines using Python
- Collaborate with data scientists, engineers, and analysts to ensure data solutions meet business needs
- Optimize data pipelines for performance and scalability
- Troubleshoot and resolve data-related issues
- Document findings, results, and system improvements
Must-Have:
- Minimum 4 years of development experience, with strong proficiency in Python
- Good knowledge of Python for scripting and development
- Excellent analytical and problem-solving skills
- Good communication skills with fluency in English
- Experience in describing and documenting data flows and data pipelines
- Ability to analyze business requirements and translate them into project requirements
Nice-to-Have:
- Familiarity with data pipeline construction and optimization
- Knowledge of data visualization tools and techniques
- Experience with machine learning frameworks and libraries (e.g., TensorFlow, PyTorch, Scikit-learn)
- Understanding of data structures and data formatting
- Hands-on experience with running experiments and documenting results
- Exposure to deep learning concepts and applications
- Experience with Java or R programming