You will design and implement data pipelines and architectures within the Google Cloud Platform ecosystem.
Responsibilities
- Build and maintain ETL processes and distributed architectures for big data workloads.
- Design data mapping and modeling frameworks for structured and unstructured data sources.
- Manage data warehousing and business intelligence modeling requirements.
- Develop scalable data solutions using both batch and streaming services.
Required Skills
- 8+ years of experience in Data Engineering, ETL, and EDW.
- Expertise in Python development.
- Hands-on experience with Google Cloud BigQuery.
- Extensive experience writing SQL across various database systems.
- Proficiency in one or more: JavaScript, Java, R, UNIX Shell, php, or ruby.
- Experience with cloud analytics and both structured and unstructured data.
- Knowledge of data mining, cloud computing, and data management tools.
- Experience with Hadoop, HDFS, MapR, or Spark.
Preferred Skills
- Experience with DataProc and DataFlow using Java on GCP.
- Familiarity with serverless data warehousing and Google Cloud services like Cloud Storage, Cloud Dataflow, DFunc, and Big Table.