Description

You will lead data engineering efforts for EDP cloud modernization and source data migration. You will design, build, and manage data pipelines within a Google Cloud data lake environment. Your work involves migrating legacy data to GCP, optimizing analytics queries, and ensuring robust data architecture.

Responsibilities

  • Build and manage data pipelines for ingestion, processing, and transformation using Google Cloud services.
  • Design and implement scalable data architecture within a data lake environment.
  • Execute complex data migration tasks from legacy systems (Teradata, Oracle) to Google Cloud Platform.
  • Develop analytical queries to solve complex data problems and support business intelligence.
  • Utilize ETL/ELT tools and Unix Shell scripting to automate data workflows.

Required Skills

  • 8-10 years of experience in data engineering roles.
  • Proficiency in Google Cloud services: BigQuery, Dataproc, Cloud Storage, Dataflow, Pub/Sub, and Cloud Composer.
  • Strong programming skills in Python, PySpark, and SQL.
  • Hands-on experience with Hadoop and Hive ecosystems.
  • Experience with relational databases such as Teradata or Oracle.
  • Experience with ETL/ELT tools like Informatica.
  • Ability to work with NoSQL databases such as MongoDB.
  • Unix Shell scripting experience.
  • Any graduate degree.

Preferred Skills

  • Experience with cloud migration projects.
  • Healthcare domain knowledge.

Education

ANY GRADUATE