Description

You will build and manage data pipelines using modern orchestration tools.

Responsibilities

  • Develop and maintain data pipeline orchestration using Apache Airflow.
  • Implement data processing logic using Python scripting for DAGs.
  • Design and manage data flows within various environments including cloud and on-premises.
  • Execute tests, analyze data, and resolve defects in production systems.

Required Skills

  • 5+ years of hands-on experience with Apache Airflow.
  • Proficiency in Python scripting for data workflows.
  • Experience with data pipeline orchestration best practices.
  • Familiarity with at least one major cloud platform (AWS, GCP, or Azure).
  • Understanding of CI/CD pipelines and version control (Git).
  • Experience with SQL and data warehousing concepts.
  • Exposure to Apache Kafka, topics, and partitions.
  • Ability to develop, execute tests, and troubleshoot data issues.

Preferred Skills

  • Experience with Apache NiFi.
  • Exposure to Data Lakes.
  • Familiarity with Kafka Connect, Schema Registry, or Kafka Streams.

Education

Any Graduate