Description
You will design and implement scalable data architecture and analytics solutions to support real-time and batch-based consumer data needs.
Responsibilities
- Establish best practices for data ingestion, integration, and access patterns.
- Lead the development of scalable data architecture for business and analytic use cases.
- Drive continuous data transformation to minimize technical debt.
- Create test protocols, test scripts, and validation deliverables.
- Provide technical support for data pipelines and advanced analytics solutions to local end users.
Required Skills
- 5+ years of experience designing and implementing complex data systems from the ground up.
- Proficiency in Python, SQL, and Spark.
- Experience building batch and streaming pipelines using PySpark and Pandas.
- Ability to develop and optimize machine learning models to extract insights from complex data.
- Experience transforming data using SQL, NoSQL, and Python.
- Experience with data visualization using Python and R.
- Working knowledge of cloud services in AWS or Microsoft Azure.
- Experience with OpenShift, EKS, ECS, and Databricks.
- Any Graduate degree.