Description
You will apply advanced machine learning and NLP techniques to solve complex data problems.
Responsibilities
- Implement machine learning algorithms including dimensionality reduction, clustering, embeddings, and sequence classification.
- Develop and deploy NLP solutions using large language models, prompt engineering, and fine-tuning.
- Build and maintain data models using relational, NoSQL, and vector databases.
- Benchmark model performance and manage end-to-end model lifecycles.
Required Skills
- 10+ years of professional experience in data science.
- Strong Python programming background.
- Deep learning expertise with PyTorch, TensorFlow, Keras, and Hugging Face Transformers.
- Practical NLP experience with spaCy, word2vec, Flair, and BERT.
- LLM orchestration experience using LangChain and LlamaIndex.
- Proficiency with vector stores such as ChromaDB and search engines like Elasticsearch or OpenSearch.
- Experience with relational and NoSQL databases, including Postgres.
- Knowledge of cloud platforms including AWS, GCP, or Azure.
- Understanding of data modeling principles and complex data structures.
Preferred Skills
- Experience with fine-tuning and benchmarking large language models.