You will develop solutions using Python within a Big Data environment.
Responsibilities
- Process large datasets, performing joins, merges, transformations, and summaries.
- Design and implement data control checks for data integrity.
- Interact with data stores using SQL capabilities.
- Develop and consume APIs.
Required Skills
- 5+ years of professional experience.
- Python development proficiency.
- Experience with Spark, Hive, and HDFS.
- Familiarity with Big Data platforms like Hortonworks.
- Proficiency in SQL for data interaction.
- Experience working in a Linux environment.
- Knowledge of Scala and Java.
- Experience with RDS data processing.