You will design and maintain cloud-based data warehouse and lake infrastructure on AWS.
Responsibilities
Develop ETL processes to integrate retail datasets into a unified data model.
Create QuickSight dashboards to provide retailer insights.
Enable self-serve analytics and support ad-hoc reporting requirements.
Implement real-time monitoring and alerting for operational data.
Mentor junior engineers and manage multiple projects across teams.
Required Skills
5+ years of data engineering experience.
Expert-level Python skills.
Extensive experience with AWS Big Data tools: S3, Redshift, QuickSight, Glue, Lake Formation, EMR/Spark, Kinesis, Firehose, Kafka, Athena, Lambda, and Step Functions.
Proficiency with IAM roles, permissions, and data security best practices including encryption and governance.
Knowledge of API authorization standards such as OAuth 2.0, JWT, and OWASP Top 10.
Experience with big data file formats including Parquet, Avro, and ORC.
Proficiency in software testing strategies covering unit, integration, and end-to-end tests.
Understanding of distributed systems for data storage and computing.
Proficiency with non-relational databases and data stores.