Description

You will build and maintain data-focused software systems and analytical pipelines.

Responsibilities

  • Implement and maintain CI/CD pipelines using Azure DevOps, GitLab, Travis, or Jenkins.
  • Develop server-side applications using Python-based frameworks such as FastAPI or Django.
  • Integrate data from semi-structured and unstructured sources through ETL/ELT frameworks.
  • Optimize Spark clusters and manage data processing workflows.
  • Perform data cleaning, enrichment, and reporting for business users.

Required Skills

  • 5-8+ years of experience in data-focused software engineering.
  • 2+ years of experience in data analysis, AI, or data science work.
  • Extensive experience with Python, (py)Spark, JSON, and SQL.
  • Proficiency with Azure DevOps and modern SDLC practices including unit, integration, and UA testing.
  • Strong knowledge of SQL and familiarity with modern data stores and NoSQL database systems.
  • Experience with data visualization tools such as PowerBI or Tableau.
  • Experience with web scraping and crowdsourcing techniques.
  • Master’s Degree or Ph.D. in data science, computer science, or a related quantitative field, or equivalent experience.

Preferred Skills

  • Experience or strong interest in implementing LLMs in a productized context.

Education

Bachelor's degree in Computer Science