You will build and maintain data-focused software systems and analytical pipelines.
Responsibilities
- Implement and maintain CI/CD pipelines using Azure DevOps, GitLab, Travis, or Jenkins.
- Develop server-side applications using Python-based frameworks such as FastAPI or Django.
- Integrate data from semi-structured and unstructured sources through ETL/ELT frameworks.
- Optimize Spark clusters and manage data processing workflows.
- Perform data cleaning, enrichment, and reporting for business users.
Required Skills
- 5-8+ years of experience in data-focused software engineering.
- 2+ years of experience in data analysis, AI, or data science work.
- Extensive experience with Python, (py)Spark, JSON, and SQL.
- Proficiency with Azure DevOps and modern SDLC practices including unit, integration, and UA testing.
- Strong knowledge of SQL and familiarity with modern data stores and NoSQL database systems.
- Experience with data visualization tools such as PowerBI or Tableau.
- Experience with web scraping and crowdsourcing techniques.
- Master’s Degree or Ph.D. in data science, computer science, or a related quantitative field, or equivalent experience.
Preferred Skills
- Experience or strong interest in implementing LLMs in a productized context.