The advancement and placement team has a very important role throughout Syngenta. When developing new varieties, the amount of data collected is quite big and has to be managed properly. Collaborating with the data scientists in the team as well as the data owners, you will play a key role in facilitating the data processing and ensuring that the infrastructures and solutions are fulfilling all of the needs of the users.
Responsibilities
- Build up data model and data pipeline to enable automated analysis.
- Conduct QC into the data process
- Implement unit tests
- Enhance existing infrastructures and propose new solutions if needed
- Cooperate with different actors with multiple backgrounds
- Currently enrolled in a Bachelor's or Master's degree program in Computer Science/ Data Engineering/Big Data/Data Science
- With data engineering background, can build the backbone of data infrastructure and contribute to acquiring, transforming, and cleaning data at scale.
- Ability to build, test, and maintain robust database pipeline architectures
- Ability to Integrate various data sources (databases, APIs, streams)
- Able to develop and optimize data processing jobs using AWS services or similar solutions
- Ability to collaborate and understand the needs of data owners as well translating it to simple views
- Strong knowledge in Python and especially pandas, numpy, pytest,
- Some experience with application development is preferred
- Familiarity with version control systems (Git) and code repositories (GitHub, GitLab)
- Understanding of data modeling, ETL processes, and data quality principles
- Excellent problem-solving and debugging skills