GCP Data Engineer

Prime Vector Consulting
Irving, TX, USA

Description

You will lead data engineering efforts for EDP cloud modernization and source data migration. You will design, build, and manage data pipelines within a Google Cloud data lake environment. Your work involves migrating legacy data to GCP, optimizing analytics queries, and ensuring robust data architecture.

Responsibilities

Build and manage data pipelines for ingestion, processing, and transformation using Google Cloud services.
Design and implement scalable data architecture within a data lake environment.
Execute complex data migration tasks from legacy systems (Teradata, Oracle) to Google Cloud Platform.
Develop analytical queries to solve complex data problems and support business intelligence.
Utilize ETL/ELT tools and Unix Shell scripting to automate data workflows.

Required Skills

8-10 years of experience in data engineering roles.
Proficiency in Google Cloud services: BigQuery, Dataproc, Cloud Storage, Dataflow, Pub/Sub, and Cloud Composer.
Strong programming skills in Python, PySpark, and SQL.
Hands-on experience with Hadoop and Hive ecosystems.
Experience with relational databases such as Teradata or Oracle.
Experience with ETL/ELT tools like Informatica.
Ability to work with NoSQL databases such as MongoDB.
Unix Shell scripting experience.
Any graduate degree.

Preferred Skills

Experience with cloud migration projects.
Healthcare domain knowledge.

Key Skills

SQL Hadoop Hive PySpark Python

Education

ANY GRADUATE

Apply Now

Back To Jobs

Posted On: 8 days Ago
Category: Data Engineer
Tenure: Full-Time Position