You will build and maintain data pipelines and ETL processes to manage large-scale datasets.
Responsibilities
- Design and test data architectures to align with business requirements.
- Implement and optimize data models for efficient querying and reporting.
- Develop and maintain data quality checks and monitoring processes.
- Contribute to the alignment of data architecture with organizational solutions.
Required Skills
- 9+ years of overall IT experience.
- Minimum 4 years of hands-on experience with Python and Spark.
- Strong SQL capabilities.
- Experience managing big data using ETL tools like Informatica.
- Proficiency with AWS services: S3, Redshift, Lambda, EMR, Airflow, Postgres, SNS, and Event Bridge.
- Experience with Bash/Shell scripting.
- Any Graduate degree.
Preferred Skills
- Experience with Kafka, MuleSoft API, and Iceberg format in a Data Lakehouse architecture.
- Understanding of healthcare data systems and Agile methodologies.