You will build and manage data pipelines and ETL processes within Google Cloud Platform.
Responsibilities
- Build and maintain ETL processes to move data across environments.
- Implement streaming data pipelines into BigQuery and other GCP services.
- Develop applications that connect to remote APIs.
- Collaborate with Operations teams to tune existing and new architectures.
- Align Google best practices with specific customer requirements for analytics delivery.
Required Skills
- 6+ years of experience in Data Analytics and Big Data.
- Strong Python programming skills.
- Experience with GCP products: BigQuery, GCS, Cloud Dataflow, Cloud Pub/Sub, and Cloud Bigtable.
- Hands-on experience with ETL implementation and data pipeline development.
- Proficiency with relational databases such as PostgreSQL and MySQL.
- Experience with NoSQL systems including Redis, Cassandra, and MongoDB.
- Understanding of real-time streaming and processing for logs, time series, and unstructured data.
Preferred Skills
- Experience migrating Data Warehouses or Hadoop projects from on-prem to GCP using DataProc.