You will develop and maintain ETL workflows using Pentaho to support new solutions and modernize existing data processes.
Responsibilities
- Design, develop, and maintain ETL workflows in Pentaho (PDI) by re-engineering processes from Shell scripts or Java.
- Implement scalable data integration pipelines in Pentaho to meet evolving business needs.
- Analyze and reverse-engineer existing data workflows to build equivalent solutions in Pentaho.
- Support the migration of on-prem or custom data solutions to Azure Cloud, integrating services like Azure Blob Storage and ADF.
- Develop parameterized, modular Pentaho transformations and jobs while ensuring data quality and error handling.
Required Skills
- 5+ years of experience refactoring legacy scripts into ETL jobs using visual tools like Pentaho.
- Strong background in data processing workflows implemented in Shell scripts or Java.
- Hands-on experience with Azure Data Factory, Azure Blob Storage, Azure SQL, and Azure Key Vault.
- Proficiency in SQL, stored procedures, and performance tuning across Oracle, PostgreSQL, and SQL Server.
- Experience handling integration via APIs, CSV, JSON, and XML sources/targets.
- Familiarity with Git/GitLab for version control and CI/CD concepts for ETL.
- Experience with data validation, audit logging, and implementing data retention policies.
- Knowledge of Agile methodologies and tracking tasks in JIRA.