Description

You will implement and operationalize modern AI-enabled data capabilities on Google Cloud to ingest, transform, and distribute data for big data applications. You leverage AI and agentic frameworks to automate data management, governance, and consumption, including pipelines, quality, metadata, and compliance. You collaborate with principal engineers and product managers to roadmap and deliver key data capabilities based on organizational priorities. You support the migration from on-premises systems to cloud platforms, ensuring seamless transition and integration. You develop and support data flows using Kafka, Flink, Spark streaming, and other modern processing tools.

Responsibilities

  • Implement and operationalize AI-enabled data capabilities on Google Cloud.
  • Leverage AI frameworks to automate data management, governance, and consumption.
  • Collaborate with engineering and product teams to roadmap data capabilities.
  • Support migration from on-premises systems to cloud platforms.
  • Develop and support data flows using Kafka, Flink, and Spark streaming.

Required Skills

  • 5+ years of experience in data engineering with cloud solutions.
  • Hands-on work with Spark, Kafka, Airflow, Google Cloud Storage, BigQuery, Data Proc, and Cloud Composer.
  • Skills using AI tools like LangChain, LangGraph/ADK, agentic frameworks, RAG, GraphRAG, and MCP.
  • Deep understanding of cloud-based data lakes, warehouses, and automated data pipelines.
  • Experience developing data flows and automations in large data environments.

Preferred Skills

  • Relevant cloud certifications such as GCP Professional Data Engineer, Azure Data Engineer, or AWS Specialty Data Analytics.

Education

Any Graduate