Description
You will design and build secure, performant data architectures and end-to-end data pipelines for structured and unstructured data.
Responsibilities
- Own end-to-end data projects from initial architecture and pipeline construction to data processing and dashboard building.
- Lead project teams of data engineers to develop reliable, cost-effective solutions for data systems and models.
- Design, code, and integrate data subsystems, ensuring scalability within complex distributed systems.
- Write and execute complete testing plans, debugging code and integration issues within the data system architecture.
- Mentor junior staff and communicate technical design proposals and progress to management and cross-functional teams.
Required Skills
- 4 to 6 years of experience in data engineering.
- Proficiency in Python and Linux shell scripting.
- Hands-on experience with Databricks and PySpark.
- Experience with Node.js and NoSQL and relational database systems.
- At least 1 year of experience with AWS, specifically Redshift and S3.
- Strong understanding of complex, distributed, and massively parallel systems.
- Ability to manage data pipelines across multiple systems.
- Experience with database architecture testing, debugging, and script execution.
- Bachelor's or Master's degree in Computer Science, Information Systems, Engineering, or a related field.
Preferred Skills
- Experience with AWS EC2 and Lambda.
- Familiarity with BI visualization tools such as Looker, Power BI, or Tableau.