Description
You will apply statistical and machine learning techniques to solve complex problems using structured and unstructured data. You own the full lifecycle of data models, from hypothesis formulation to deployment and quality assurance.
Responsibilities
- Develop and deploy statistical applets and predictive models using Python, R, and SQL.
- Perform text mining and analysis on large, complex datasets to drive strategic business decisions.
- Formulate hypotheses, test conclusions, and present complex statistical concepts to non-analytical stakeholders using Excel, Word, and PowerPoint.
- Execute quality assurance activities and develop test cases for data models in Linux and AWS cloud environments.
- Leverage C, C++, or other object-oriented languages to support computational efficiency.
Required Skills
- 5+ years of experience in data science, machine learning, or related analytical roles.
- Proficiency in Python, R, and SQL for data manipulation and modeling.
- Strong knowledge of descriptive and inferential statistics models.
- Hands-on experience with text mining and processing large-scale datasets.
- Familiarity with Linux, AWS, cloud computing, and high-performance computing environments.
- Proficiency with data visualization tools such as Spotfire or Tableau.
- B.Tech degree in a relevant field.
- Experience with C, C++, or other object-oriented programming languages.
Preferred Skills
- Prior experience in pharmaceutical research and development.
- Coursework in chemistry, biology, or engineering.