Description

You will build and maintain data pipelines and ETL processes to manage large-scale datasets.

Responsibilities

  • Design and test data architectures to align with business requirements.
  • Implement and optimize data models for efficient querying and reporting.
  • Develop and maintain data quality checks and monitoring processes.
  • Contribute to the alignment of data architecture with organizational solutions.

Required Skills

  • 9+ years of overall IT experience.
  • Minimum 4 years of hands-on experience with Python and Spark.
  • Strong SQL capabilities.
  • Experience managing big data using ETL tools like Informatica.
  • Proficiency with AWS services: S3, Redshift, Lambda, EMR, Airflow, Postgres, SNS, and Event Bridge.
  • Experience with Bash/Shell scripting.
  • Any Graduate degree.

Preferred Skills

  • Experience with Kafka, MuleSoft API, and Iceberg format in a Data Lakehouse architecture.
  • Understanding of healthcare data systems and Agile methodologies.

Education

Any Graduate