Pipeline Development: Designing and maintaining ETL/ELT pipelines to integrate data from diverse sources like Amazon S3, RDS, or external APIs into Redshift.
Data Transformation: Utilize Redshift Stored Procedure to Extract Load and transform data.
Logic Encapsulation: Writing Redshift Stored Procedures (using PL/pgSQL) to group multiple SQL statements into a single unit, ensuring they succeed or fail as an atomic transaction.
Orchestration: Automate workflows using AWS Step Functions or Amazon MWAA (Airflow).
Performance Tuning: Optimize Redshift query performance through appropriate distribution styles, sort keys, and compression.
Maintenance & Documentation: Support, troubleshoot, and document ETL workflows, including data mapping, transformation rules, and technical designs.
Preferred Skill And Experience
Scripting: Strong scripting abilities in Python or UNIX shell for file processing and automation.
Additional Required Qualifications
Bachelor’s degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education