You will design, build, and maintain data pipelines using cloud services and data processing frameworks.
Responsibilities
- Design, develop, and maintain data pipelines using AWS services and Databricks.
- Utilize Python and PySpark for data transformation and processing.
- Implement data modeling and warehousing solutions for analytics.
- Write complex SQL queries for data extraction and performance tuning.
- Monitor and optimize data workflows for efficiency and reliability.
Required Skills
- 5+ years of professional experience.
- Expertise in PySpark and Python.
- Proficiency in SQL and Data Modelling.
- Hands-on experience with AWS.
- Experience with Databricks.
- Knowledge of data pipeline design and ETL best practices.
- Familiarity with data governance and compliance.
Preferred Skills
- Experience with AWS Cloud or AWS-Amazon Web Services.
- Familiarity with Azure Databricks or PL/SQL.