You will build and maintain scalable Big Data applications on distributed platforms.
Responsibilities
- Develop scalable Big Data applications and solutions on distributed platforms.
- Architect data products using Streaming, Serverless, and Microservices Architecture.
- Design and implement data pipelines utilizing EMR, Airflow, and Databricks.
- Configure Jenkins pipelines for CI/CD processes targeting Managed Spark jobs.
- Partner with teams to solve complex problems and influence stakeholders.
Required Skills
- 10+ years developing scalable Big Data applications.
- Proficiency in distributed technologies including Spark, Python, and Scala.
- Experience with AWS services: S3, Managed Airflow, EMR/EC2, IAM.
- Hands-on experience with Data warehousing tools: SQL database, Presto, and Snowflake.
- Experience architecting data products using Data platforms like EMR and Airflow.
- Familiarity with CI/CD practices, including building Docker images.
- Working knowledge of Data modelling, Data Governance, and Data Architecture.
- Experience with reporting tools such as Tableau and Quicksite.
- Bachelor's degree in Computer Science or a relevant field.