You will build and deploy machine learning models and data pipelines within a Databricks environment.
Responsibilities
- Design, develop, and deploy machine learning models using Databricks.
- Build and optimize data pipelines and workflows using PySpark and SQL.
- Analyze large, complex datasets to extract patterns and trends.
- Collaborate with data engineers, analysts, and stakeholders to deliver insights.
- Document processes, models, and code for reproducibility.
Required Skills
- 5+ years of experience in data science or a related role.
- Hands-on experience with Databricks, including notebooks, clusters, and jobs.
- Proficiency in Python, Scala, and SQL.
- Experience with big data tools including Spark and Delta Lake.
- Strong understanding of machine learning algorithms and statistical methods.
- PySpark and SQL expertise.
Preferred Skills
- Experience with cloud platforms including Azure, AWS, or GCP.
- Familiarity with MLflow and Databricks REST APIs.
- Experience with data visualization tools like Power BI or Tableau.