Description

You will focus on backend development using the AWS Data Integration and Storage tech stack to build and automate data pipelines.

Responsibilities

  • Design and implement scalable ETL processes and data pipelines using AWS Glue, AWS Lambda, and Apache Spark.
  • Transition existing AWS data pipelines and procedures developed for the Department of Health and Human Services.
  • Develop and maintain data models, schemas, and transformation logic for data warehousing and analytics.
  • Implement data ingestion from various sources into Amazon S3, Amazon Redshift, or Amazon DynamoDB.
  • Optimize pipelines for performance and cost-efficiency while monitoring for bottlenecks and consistency issues.
  • Collaborate with Data Engineers, Analysts, and Report Developers to automate recurring data requests and troubleshoot issues.

Required Skills

  • 5+ years of professional experience as a Data Engineer with a focus on AWS services.
  • Expertise in AWS Glue, AWS Lambda, and AWS Spark.
  • Hands-on experience with AWS Data Migration Services, AWS RDS, Amazon S3, Amazon Redshift, and Amazon DynamoDB.
  • Strong proficiency in SQL, Microsoft SQL Server, and MySQL.
  • Experience with Microsoft SQL Server Integration Services (SSIS).
  • Proficiency in programming languages such as Python, Scala, or Java for data processing.
  • Deep understanding of data modeling, data warehousing, and data integration concepts.
  • Ability to follow standard practices for migrating changes to test and production environments.
  • Bachelor’s or master’s degree in computer science, Data Engineering, or a related field.

Education

Any Graduate