Description

You will lead the technical strategy, architecture, and delivery planning for a large-scale data modernization initiative.

Responsibilities

  • Own architecture design for AWS EMR, S3 (Iceberg), MWAA, and Athena-based Data Lake platforms.
  • Guide data migration from Cloudera (HDFS, HiveQL, PySpark) to AWS S3 and Iceberg.
  • Define EMR cluster sizing, cost optimization strategies, and instance usage.
  • Lead design decisions regarding schema evolution, partitioning, and time travel in Iceberg.
  • Collaborate with stakeholders to manage project scope and technical deliverables.

Required Skills

  • 10+ years of experience in cloud and big data architecture.
  • Expertise in AWS EMR, S3, Iceberg, Redshift, Athena, Glue, and MWAA.
  • Strong understanding of Spark internals, PySpark, HiveQL, and HDFS architecture.
  • Proven experience executing large-scale data lake migrations.
  • Ability to provide hands-on oversight to onsite and offshore engineering teams.
  • Any Graduate degree.

Education

Any Graduate