Description

You will build and maintain scalable Big Data applications on distributed platforms.

Responsibilities

  • Develop scalable Big Data applications and solutions on distributed platforms.
  • Architect data products using Streaming, Serverless, and Microservices Architecture.
  • Design and implement data pipelines utilizing EMR, Airflow, and Databricks.
  • Configure Jenkins pipelines for CI/CD processes targeting Managed Spark jobs.
  • Partner with teams to solve complex problems and influence stakeholders.

Required Skills

  • 10+ years developing scalable Big Data applications.
  • Proficiency in distributed technologies including Spark, Python, and Scala.
  • Experience with AWS services: S3, Managed Airflow, EMR/EC2, IAM.
  • Hands-on experience with Data warehousing tools: SQL database, Presto, and Snowflake.
  • Experience architecting data products using Data platforms like EMR and Airflow.
  • Familiarity with CI/CD practices, including building Docker images.
  • Working knowledge of Data modelling, Data Governance, and Data Architecture.
  • Experience with reporting tools such as Tableau and Quicksite.
  • Bachelor's degree in Computer Science or a relevant field.

Education

Bachelor's degree