You will build and manage data pipelines and cloud infrastructure within the AWS ecosystem.
Responsibilities
- Create and manage cloud resources in AWS to support data ingestion from RDBMS, REST HTTP APIs, flat files, streams, and time series data.
- Implement data processing and transformation workflows using Spark and other cloud services.
- Develop automated data quality checks to ensure accuracy and verify calculation results.
- Build infrastructure to collect, transform, combine, and distribute customer data.
- Participate in regular Scrum ceremonies and mentor junior team members on industry best practices.
Required Skills
- 5+ years of experience in data engineering roles.
- Proficiency with AWS cloud services.
- Strong programming skills in Python.
- Advanced SQL for complex querying and reporting.
- Experience with Spark for big data processing.
- Hands-on experience with Airflow for orchestration.
- Working knowledge of Snowflake.
- Experience handling data from RDBMS and REST HTTP APIs.
- Ability to implement business logic within data platforms.
Preferred Skills
- Experience with data visualization tools to present analytical results to stakeholders.