Description
You will design and lead scalable data architectures and pipelines to support analytics and business intelligence.
Responsibilities
- Design and maintain scalable data pipelines and architectures.
- Lead data projects and enforce engineering best practices.
- Optimize data systems for analytics and reporting.
- Ensure data quality and system reliability in production environments.
Required Skills
- 8+ years of IT experience.
- 5+ years of experience with Python, PySpark, and SQL for big data processing.
- Experience with data lakes using Iceberg format.
- Proficiency in ETL processes using Informatica.
- Hands-on experience with AWS services including S3, Glue, Redshift, Lambda, EMR, and Airflow.
- Experience with Postgres and BASH/Shell scripting.
- Background working with healthcare data.
- Experience leading data teams.
- Practical experience in Agile development.