Build and maintain scalable, automated data pipelines for logs and event streaming data on cloud platforms. Design data models and architectures to support analytics and machine learning initiatives.
Responsibilities
- Develop reusable frameworks and new capabilities to improve data processing efficiency.
- Ensure data quality, security, and reliability across all data solutions.
- Collaborate with data scientists and stakeholders to define data needs and deliver actionable insights.
- Optimize enterprise-scale data pipelines using modern tools.
- Apply data governance principles to support business-oriented problem solving.
Required Skills
- Bachelor's in Computer Science or related field with 4+ years experience, or Master’s with 3+ years.
- Experience building and optimizing enterprise-scale data pipelines.
- Proficiency in Spark, Airflow, and Azure.
- Strong command of SQL and Python for data transformation and automation.
- Deep understanding of data architecture, data modeling, and data governance.