Data Engineer

AI overview

Design and maintain scalable data pipelines and distributed systems using Python, PySpark, and AWS, while leveraging strong data modeling and ETL framework experience.

Job Description

We are seeking an experienced Data Engineer to design, build, and maintain scalable data pipelines and distributed data systems. The ideal candidate will have strong expertise in Python, PySpark, and AWS-based data platforms, along with solid experience in data modeling and ETL frameworks.

Location: Columbus, OH
Work Mode: Onsite
Employment Type: Contract

Preferred / Nice to Have

  • Experience with Airflow or other workflow orchestration tools

  • Knowledge of Kafka, Kinesis, or streaming data platforms

  • Experience with Docker/Kubernetes

  • Exposure to Delta Lake, Iceberg, or Hudi

Requirements

Required Skills & Qualifications

  • Strong proficiency in Python for data processing and pipeline development

  • Hands-on experience with Apache Spark (PySpark preferred)

  • Solid experience with AWS services such as S3, Glue, EMR, Redshift, Athena, and Lambda

  • Strong experience with SQL and relational/non-relational databases

  • Knowledge of data modeling, data warehousing concepts, and ETL frameworks

  • Experience working with large-scale distributed data systems

  • Familiarity with CI/CD pipelines and Git

  • Strong analytical, problem-solving, and communication skills

Hudson Manpower Inc. specializes in recruitment and project staffing for the Oil, Gas, Petrochemical, Power Plant, and Construction industries, supplying quality manpower to reduce training costs and bridge the gap between employers and employees.

View all jobs
Ace your job interview

Understand the required skills and qualifications, anticipate the questions you may be asked, and study well-prepared answers using our sample responses.

Data Engineer Q&A's
Report this job
Apply for this job