Description
You will design and build data solutions to support analytical and reporting needs.
Responsibilities
- Design and build data marts, curated layers, and reusable data models for analytical and reporting needs.
- Develop high-performance ETL/ELT pipelines using Azure Databricks, PySpark, Python, and SQL.
- Work with large-scale structured and unstructured datasets using distributed computing techniques.
- Implement data transformation workflows aligned to medallion architecture (Bronze → Silver → Gold).
- Ensure data quality, lineage, security, and compliance with enterprise standards.
Required Skills
- 7+ years of experience in Data Engineering.
- Strong expertise in Python, PySpark, and advanced SQL.
- Hands-on experience building large-scale data pipelines, data lakes, and data marts.
- Experience with Azure Databricks, Azure Data Factory, ADLS, Synapse, and Key Vault.
- Deep understanding of Delta Lake, distributed computing, and performance optimization.
- Knowledge of relational and dimensional modeling, star schemas, and Slowly Changing Dimensions (SCDs).
- Proficiency with CI/CD pipelines.
- Experience with data quality, metadata, and governance practices.