CV Submission until 26.07.2024
Service delivery: Near Site, Belgium - The services shall be performed remotely in Near-site location allowing to reach the Commission premises in Brussels within 2 hours.
VASS, in partnership with the European Commission, is currently seeking an Data Scientist - AI/ML/NLP to work for the European Commission in Belgium
DESCRIPTION OF THE TASKS
- Development and maintenance of software applications in the field of Natural Language Processing (NLP), Machine Learning (ML) and/or Artificial Intelligence (AI);
- Training of custom machine learning / deep learning models based on structured and unstructured data;
- Selecting features, building and optimizing classifiers using machine learning techniques;
- Studies and developments aiming at improving the quality of machine translation (MT) engines for each installed language pair, addressing the specific needs of customers of the service concerning MT quality and contributing to a general strategy for the systematic evaluation and long-term improvement of MT quality;
- Interact with data stewards and other IT stakeholders to define the data rules;
- Define data controls and implement actions to ensure data quality and integrity;
- Creating automated anomaly detection systems and constant tracking of its performance;
- Data mining using state-of-the-art methods;
- Processing, cleansing, and verifying the integrity of data used for analysis;
- Design the IT architecture for solitons in the NLP / ML / AI fields, and coordinate its implementation considering master- and meta-data management concepts;
- Analyse data architecture for consistency, completeness, accuracy and reasonableness;
- Contributing for the analysis of data management vision, strategy and policy and derive the IT requirements; Contributing to the design of the IT architecture considering master- and meta-data management concepts; Elaboration of test programs;
- Writing of technical documentation;
- Assistance with deployment and configuration of the system;
- Participation in meetings with the contracting authority, project teams and user groups.
JOB REQUIREMENTS
-
Master's Degree in Computer Science or related field
-
Extensive hands-on experience with the design and implementation of Retrieval Augmented Generation (RAG).
- Very good knowledge of programming in Python (programming in R is considered an asset).
- Very good experience with libraries and frameworks like TensorFlow, Hugging Face, PyTorch, scikit-learn, or Keras.
- Very good knowledge with using cloud platforms like AWS, Azure, or Google Cloud for designing/implementing scalable AI/ML/RAG/etc. solutions.
- Extensive hands-on experience with various machine learning algorithms, including supervised, unsupervised, and reinforcement learning.
- Good knowledge of deep learning and neural networks.
- Knowledge of the involved Mathematics and Statistics (Linear algebra, Calculus, Probability theory Statistics, Optimization techniques, etc.).
- Good knowledge of data pre-processing and analysis techniques.
- Good experience with data visualization using tools like Matplotlib, Seaborn or Plotly.
- Good experience with RDBMS like Oracle (latest version).
- Good experience of NoSQL databases like MongoDB or Cassandra.
- Experience in training deep learning models.
- Good knowledge of NLP techniques and libraries like NLTK, spaCy, or Transformers.
- Extensive experience with model evaluation and validation techniques.
- Extensive experience with deploying ML models to production.
- Good knowledge of potential bias and fairness issues in AI systems.
- Knowledge of the applicable data privacy regulations (GDPR) and compliance with AI/ML applications will be considered as an asset.
- Familiarity with reinforcement learning algorithms and environments.
- Familiarity with distributed computing frameworks such as Apache Spark or Hadoop for handling large dataset will be considered an asset.
- Ability to give business and technical presentations.
- Ability to apply high quality standards.
- Ability to cope with fast changing technologies used in the domain of Artificial Intelligence.
- Good communication skills with technical and non-technical audiences.
- Analysis and problem-solving skills.
- Capability to write clear and structured technical documents.
- Ability to participate in technical meetings and good communication skills.
- Very good knowledge of English (both written and oral).
WHO ARE WE?
VASS (https://vasscompany.com/en/) is a leading digital solutions group of companies headquartered in Madrid, Spain, present in 26 countries in Europe, the Americas and Asia with more than 4,700 professionals
VASS helps large companies in their digital transformation process, developing and executing the most innovative and scalable projects, from strategy to operations.
All our growth comes from our talented people, passion for innovation, and a constant search for improvement, always the VASS way: “Complex made simple”.