Description
Lead the design and implementation of VA claims processing data systems while mentoring data engineers. You will architect enterprise-level solutions from prototyping through production deployment, ensuring compliance with federal regulations.
Responsibilities
- Architect and build production-grade ETL/ELT pipelines, data transformations, and integration frameworks using Python and SQL.
- Design conceptual, logical, and physical data models, actively building database schemas and table structures.
- Optimize data warehouse solutions in PySpark, AWS Glue, AWS Redshift, and Google BigQuery for performance and cost.
- Deploy CI/CD pipelines for data infrastructure using Terraform or CloudFormation across AWS and GCP environments.
- Establish engineering best practices, code review standards, and technical documentation across the team.
Required Skills
- 10+ years of progressive data engineering experience with hands-on technical leadership.
- Recent (2-3 years) hands-on experience building and deploying production data systems.
- Strong programming skills in Python and SQL with a focus on scalable, maintainable codebases.
- Expertise in cloud-based data warehousing (AWS Redshift, BigQuery) and ETL/ELT tools.
- Proficiency in infrastructure as code using Terraform and CloudFormation.
- Hands-on implementation experience with relational and non-relational database technologies.
- Familiarity with VA data standards, FHIR, and federal compliance requirements like HIPAA and CCPA.
Preferred Skills
- Experience with Greenfield projects, federal healthcare claims environments, or legacy system migrations.
- Knowledge of streaming data platforms (Kafka, Kinesis) and real-time processing.