You will develop and manage data processing frameworks and production-level pipelines within the Google Cloud ecosystem.
Responsibilities
- Build and deploy reusable Dataflow Flex templates using Apache Beam for batch and stream processing.
- Design and implement production-level data pipelines using Kafka and Event Driven Architecture.
- Manage real-time streaming of database changes and events via Pub/Sub, Kafka, or Confluent Kafka.
- Configure and maintain Kafka Connect using various connector types including HTTP REST proxy, JMS, File, SFTP, and JDBC.
Required Skills
- 6-8 years of experience as a Java Apache Beam Developer.
- Strong proficiency in Java and Google Cloud Platform (GCP).
- Deep knowledge of Dataflow (Apache Beam) for batch and stream processing.
- Hands-on experience with Google Cloud SQL, BigQuery, and Apigee.
- Experience with Kafka, Confluent Kafka, and Pub/Sub for real-time streaming.
- Expertise in Event Driven Architecture and the Kafka Connect framework.
- Working knowledge of BigTable, Cloud Storage, Compute Engine, Cloud Function, Cloud Run, and App Engine.
- Experience with Apache Hadoop family distributed storage and processing utilities.
Preferred Skills
- In-depth understanding of Google product technology and underlying architectures.