Description

You will build and maintain data pipelines within the Hadoop ecosystem, owning technical architecture decisions and sprint delivery.

Responsibilities

  • Execute sprint stories and deliver tasks within defined timelines and quality standards.
  • Define and discuss technical architectures, evaluating pros and cons with the team.
  • Participate in brainstorming sessions to suggest improvements to system design.
  • Collaborate with team leads and US-based counterparts to review architecture.
  • Communicate project status, risks, and issues to all stakeholders.

Required Skills

  • 4.5 to 6 years of experience in Data Engineering development.
  • Hands-on experience with the Hadoop ecosystem, including HDFS, Hive, and Yarn.
  • Strong knowledge of Apache Spark and PySpark.
  • Proficiency in Python programming.
  • Experience working with Hadoop and Hive.
  • Advanced SQL skills.
  • Experience with file formats such as Avro and Parquet.

Preferred Skills

  • Bachelor's degree in Computer Science, Software Engineering, IT, or a related field.

Education

Bachelor's degree