Develop and manage data processing workflows within the Hadoop ecosystem.
Responsibilities
- Build and maintain data pipelines using Sqoop, Hive, and HBase.
- Develop distributed processing applications with Spark, Python, and Scala.
- Manage workflow orchestration using Oozie.
- Plan and organize technical deliverables while adhering to software development standards.
- Collaborate with cross-functional teams and stakeholders to meet project requirements.
Required Skills
- 4+ years of experience in Big Data development.
- Hands-on experience with Sqoop, Hive, HBase, and Spark.
- Proficiency in Python and Scala.
- Experience with Oozie for workflow management.
- Strong SQL skills.
- Proficiency in Unix basic scripting environments.
- Bachelor's degree in a relevant field.
- Ability to prioritize and manage multiple work streams independently.