You will lead the technical strategy, architecture, and delivery planning for a large-scale data modernization initiative.
Responsibilities
- Own architecture design for AWS EMR, S3 (Iceberg), MWAA, and Athena-based Data Lake platforms.
- Guide data migration from Cloudera (HDFS, HiveQL, PySpark) to AWS S3 and Iceberg.
- Define EMR cluster sizing, cost optimization strategies, and instance usage.
- Lead design decisions regarding schema evolution, partitioning, and time travel in Iceberg.
- Collaborate with stakeholders to manage project scope and technical deliverables.
Required Skills
- 10+ years of experience in cloud and big data architecture.
- Expertise in AWS EMR, S3, Iceberg, Redshift, Athena, Glue, and MWAA.
- Strong understanding of Spark internals, PySpark, HiveQL, and HDFS architecture.
- Proven experience executing large-scale data lake migrations.
- Ability to provide hands-on oversight to onsite and offshore engineering teams.
- Any Graduate degree.