Staff Engineer (Data Engineer – AI & Digital Platforms)

AI overview

Design and develop scalable data pipelines and full stack applications, leveraging advanced technologies for intelligent data interaction and machine learning operationalization.

Data Engineer – AI & Digital Platforms

Must-Have Skills

  • Hadoop and MapReduce
  • Cloudera
  • AI-enabled Application Development
  • Machine Learning – General Experience
  • LLM Application Frameworks (Capable)

Key Responsibilities

  • Design and develop scalable data pipelines across Hadoop (Hive, Impala, Spark, Kafka, Iceberg) and Teradata environments.
  • Build ingestion and transformation frameworks using Java, Spark, Python, and Shell scripts.
  • Develop full stack applications and internal tools using Python, Shell scripting, and modern web frameworks (Flask, React).
  • Create APIs and microservices to expose data and ML models securely to downstream systems and user interfaces.
  • Collaborate with data scientists to operationalize ML models using Cloudera Machine Learning (CML).
  • Build and deploy GenAI/LLM-powered applications for intelligent data interaction, summarization, and automation.
  • Implement enterprise-grade security controls including RBAC, LDAP, Kerberos, Apache Ranger, and row-level access.
  • Tune and optimize data applications for performance across Hadoop and Teradata, ensuring efficient resource utilization
    • Support sandbox environments for prototyping, enabling users to build ML models, dashboards, and data pipelines.

    Required Skills & Experience

    Data Engineering

    • Strong experience with Hadoop ecosystem (Hive, Impala, Spark, Kafka, Iceberg, Ranger, Atlas), Teradata, and data pipeline orchestration.
    • Experience with MPP databases (e.g., Trino, Presto).
    • Proven ability in development and performance tuning of large-scale data applications.

 

Full Stack Development

  • Proficiency in Python, Shell scripting, REST APIs, and web frameworks (Flask, React).

Machine Learning & AI

  • Hands-on experience with ML platforms (CML), Spark MLlib, and Python ML libraries (scikit-learn, XGBoost).
  • Experience in operationalizing ML models at enterprise scale.

GenAI/LLM Applications

  • Familiarity with building applications using large language models (OpenAI, Hugging Face, LangChain).
  • Ability to build agent workflows and support users in creating agent-based solutions.

Security & Governance

  • Experience with enterprise data security (LDAP, Kerberos, RBAC), data masking, and access control.

Performance Tuning

  • Strong expertise in optimizing data applications and queries in Hadoop and Teradata environments.

Tools & Platforms

  • Cloudera Data Platform (CDP), Informatica, QlikSense, Apache Oozie, Git, CI/CD pipelines.

 

Soft Skills

  • Strong analytical and problem-solving skills.
  • Excellent communication abilities.
  • Ability to work effectively in cross-functional teams

👋🏼 We're Nagarro.We are a digital product engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (19,500+ experts across 36 countries, to be exact). Our work culture is dynamic and non-hierarchical. We're looking for great new colleagues. That's where you come in!By this point in your career, it is not just about the tech you know or how well you can code. It is about what more you want to do with that knowledge. Can you help your teammates proceed in the right direction? Can you tackle the challenges our clients face while always looking to take our solutions one step further to succeed at an even higher level? Yes? You may be ready to join us.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Staff Engineer Q&A's
Report this job
Apply for this job