Description
You will design and build data pipelines to manage and transform healthcare information.
Responsibilities
- Design and implement ETL/ELT pipelines processing raw XML and JSON from parallel data sources.
- Create DDL scripts, database tables, and implement complex logic for data transformation.
- Validate and profile data, ensuring data quality meets downstream process requirements.
- Develop and maintain data integration logic using Azure Data Factory (ADF).
Required Skills
- 8+ years of experience as a Data Engineer with a strong foundation in data analysis and engineering principles.
- 5+ years experience in healthcare data, including quality assurance, working with HL7 and FHIR standards.
- Proficiency in SQL development, including handling data arrays and LATERAL joins.
- Expertise in Azure Data Factory (ADF) and related Azure technologies.
- Hands-on experience with XML and JSON data formats.
- Strong proficiency in Python Scripting for data manipulation.
- Experience as a SQL Developer transitioning into a Data Engineering role.
- Familiarity with data validation and profiling techniques.
Preferred Skills
- Experience automating web applications using Selenium WebDriver with Python scripting.
- Hands-on experience with Snowflake for data warehousing.