Description
You will own the end-to-end data management strategy and pipeline execution.
Responsibilities
- Define and execute the data management vision, strategy, and best practices.
- Design, implement, and maintain scalable data pipelines for large-scale datasets.
- Implement data governance policies to ensure data accuracy and compliance.
- Partner with engineering and product teams to make data accessible for decision-making.
- Lead and develop a team of data engineers and analysts.
Required Skills
- 10+ years of experience managing data infrastructure and optimizing pipelines at scale.
- Strong hands-on experience with AWS services (S3, Lambda, Redshift, Glue).
- Proficiency in Python and PySpark for data processing and pipeline development.
- Expertise in using Apache Airflow and Apache Iceberg.
- Solid understanding of data privacy, quality control, and governance best practices.
- Proven ability to lead, mentor teams, and influence stakeholders on data initiatives.
- Strong problem-solving abilities with a data-driven approach.
- Experience with designing and optimizing data workflows.