You will design and deliver data solutions that translate complex mathematical algorithms into usable computational methods.
Responsibilities
Build analytical and statistical models to answer key business questions.
Develop and implement MLOps pipelines and machine learning models.
Design dashboards and visualizations to communicate data insights.
Optimize query performance and tuning across large, complex datasets.
Propose methodologies to drive increased efficiency in client workflows.
Required Skills
Advanced SQL proficiency with 5+ years of experience.
2+ years of experience working with dbt.
5+ years of experience working with relational databases including PostgreSQL, Oracle, Impala, or Cloudera.
Intermediate Python skills including pandas, numpy, scikit-learn, and tensorflow.
Experience with machine learning, rule-based systems, NLP, and computational linguistics.
Dashboard development experience using Tableau, Spotfire, or DASH.
Experience with data mining, ETL, and BI tools.
Proficiency in Git via command line and LINUX/UNIX environments.
MS in Computer Science, Chemical Engineering, or Biostatistics with 6 years of industry experience, or a PhD in a related field with 3 years of experience.
Preferred Skills
Experience with JupyterLabs, Anaconda, RStudio, or Domino Data Lab.
Knowledge of GxP requirements and bioprocess engineering/cell therapy data.