Description

You will build and deploy machine learning models and data pipelines within a Databricks environment.

Responsibilities

  • Design, develop, and deploy machine learning models using Databricks.
  • Build and optimize data pipelines and workflows using PySpark and SQL.
  • Analyze large, complex datasets to extract patterns and trends.
  • Collaborate with data engineers, analysts, and stakeholders to deliver insights.
  • Document processes, models, and code for reproducibility.

Required Skills

  • 5+ years of experience in data science or a related role.
  • Hands-on experience with Databricks, including notebooks, clusters, and jobs.
  • Proficiency in Python, Scala, and SQL.
  • Experience with big data tools including Spark and Delta Lake.
  • Strong understanding of machine learning algorithms and statistical methods.
  • PySpark and SQL expertise.

Preferred Skills

  • Experience with cloud platforms including Azure, AWS, or GCP.
  • Familiarity with MLflow and Databricks REST APIs.
  • Experience with data visualization tools like Power BI or Tableau.

Education

Any Graduate