You will build and maintain scalable, automated data processes and pipelines within a Google Cloud Platform environment.
Responsibilities
- Establish automated processes for data analysis, model development, validation, and implementation.
- Collaborate with analysts and data scientists to manage downstream impacts on data models.
- Write efficient, well-organized software for iterative release environments.
- Develop data pipelines that prepare information for ingestion and consumption.
- Maintain and optimize databases and filesystems for production reporting and analytics.
Required Skills
- 5+ years of experience in data engineering.
- Proficiency with Python and object-oriented/object-functional scripting.
- Strong experience with relational SQL databases.
- Experience with Google Cloud Platform (GCP).
- Hands-on experience with Airflow for pipeline and workflow management.
- Working knowledge of DBT for data transformation.
- Proficiency with Git and GitHub.
- Ability to work within Linux platforms.
- Experience with software engineering methodologies including unit testing, code reviews, and design documentation.