You will lead data engineering efforts for EDP cloud modernization and source data migration. You will design, build, and manage data pipelines within a Google Cloud data lake environment. Your work involves migrating legacy data to GCP, optimizing analytics queries, and ensuring robust data architecture.
Responsibilities
- Build and manage data pipelines for ingestion, processing, and transformation using Google Cloud services.
- Design and implement scalable data architecture within a data lake environment.
- Execute complex data migration tasks from legacy systems (Teradata, Oracle) to Google Cloud Platform.
- Develop analytical queries to solve complex data problems and support business intelligence.
- Utilize ETL/ELT tools and Unix Shell scripting to automate data workflows.
Required Skills
- 8-10 years of experience in data engineering roles.
- Proficiency in Google Cloud services: BigQuery, Dataproc, Cloud Storage, Dataflow, Pub/Sub, and Cloud Composer.
- Strong programming skills in Python, PySpark, and SQL.
- Hands-on experience with Hadoop and Hive ecosystems.
- Experience with relational databases such as Teradata or Oracle.
- Experience with ETL/ELT tools like Informatica.
- Ability to work with NoSQL databases such as MongoDB.
- Unix Shell scripting experience.
- Any graduate degree.
Preferred Skills
- Experience with cloud migration projects.
- Healthcare domain knowledge.