You will build and maintain data pipelines within the Hadoop ecosystem, owning technical architecture decisions and sprint delivery.
Responsibilities
- Execute sprint stories and deliver tasks within defined timelines and quality standards.
- Define and discuss technical architectures, evaluating pros and cons with the team.
- Participate in brainstorming sessions to suggest improvements to system design.
- Collaborate with team leads and US-based counterparts to review architecture.
- Communicate project status, risks, and issues to all stakeholders.
Required Skills
- 4.5 to 6 years of experience in Data Engineering development.
- Hands-on experience with the Hadoop ecosystem, including HDFS, Hive, and Yarn.
- Strong knowledge of Apache Spark and PySpark.
- Proficiency in Python programming.
- Experience working with Hadoop and Hive.
- Advanced SQL skills.
- Experience with file formats such as Avro and Parquet.
Preferred Skills
- Bachelor's degree in Computer Science, Software Engineering, IT, or a related field.