Description
You will design and manage data pipelines within an AWS ecosystem.
Responsibilities
- Build and maintain ETL processes using AWS Glue and PySpark.
- Manage data workflows across S3, EMR, and Athena.
- Write and optimize complex SQL queries for database management.
- Implement deployment workflows using Git and Bamboo CI/CD tools.
Required Skills
- 6+ years of experience in data engineering.
- Proficiency in Python and PySpark.
- Hands-on experience with AWS Glue and AWS ETL workflows.
- Strong command of SQL and database management principles.
- Experience with AWS services including S3, EMR, and Athena.
- Experience using Git and Bamboo for CI/CD.
- Any Graduate degree.
Preferred Skills
- Experience with Redshift, Aurora, or DynamoDB.
- Background working in the Financial Services industry.