Description
You will build and manage scalable data pipelines and integration layers within an AWS environment.
Responsibilities
- Design and develop data pipelines using AWS Glue, Spark, and Python with Airflow.
- Implement data modeling strategies across Master, Reference, ODS, DW, and DM layers for platform analytics.
- Develop Data APIs using Python, Flask, or FastAPI to expose platform data.
- Build event and message-based inbound and outbound integrations.
- Execute data governance, lifecycle management, and data quality processes using industry-standard catalog tools.
Required Skills
- 5+ years of experience in data engineering.
- Strong proficiency in Python, Spark, and AWS Glue.
- Deep understanding of AWS services from a data engineering perspective.
- Experience with advanced data modeling for analytics platforms.
- Ability to design APIs supporting polyglot persistence, including Document and Graph stores.
- Experience with event-driven integrations and messaging.
- Knowledge of data quality and data cataloging tools and processes.
Preferred Skills
- Experience with Informatica Cloud.
- Deep experience with cloud platforms other than AWS.