Description

You will develop and maintain ETL workflows using Pentaho to support new solutions and modernize existing data processes.

Responsibilities

  • Design, develop, and maintain ETL workflows in Pentaho (PDI) by re-engineering processes from Shell scripts or Java.
  • Implement scalable data integration pipelines in Pentaho to meet evolving business needs.
  • Analyze and reverse-engineer existing data workflows to build equivalent solutions in Pentaho.
  • Support the migration of on-prem or custom data solutions to Azure Cloud, integrating services like Azure Blob Storage and ADF.
  • Develop parameterized, modular Pentaho transformations and jobs while ensuring data quality and error handling.

Required Skills

  • 5+ years of experience refactoring legacy scripts into ETL jobs using visual tools like Pentaho.
  • Strong background in data processing workflows implemented in Shell scripts or Java.
  • Hands-on experience with Azure Data Factory, Azure Blob Storage, Azure SQL, and Azure Key Vault.
  • Proficiency in SQL, stored procedures, and performance tuning across Oracle, PostgreSQL, and SQL Server.
  • Experience handling integration via APIs, CSV, JSON, and XML sources/targets.
  • Familiarity with Git/GitLab for version control and CI/CD concepts for ETL.
  • Experience with data validation, audit logging, and implementing data retention policies.
  • Knowledge of Agile methodologies and tracking tasks in JIRA.

Education

Any Graduate