Description
Data Scientist responsible for developing and maintaining machine learning models in production, with a focus on NLP and Large Language Models.
Responsibilities
- Develop, deploy, and maintain ML models, including NLP use cases like text classification, sentiment analysis, and conversational AI.
- Design and implement solutions using LLMs (BERT, GPT, RoBERTa), applying fine-tuning, prompt engineering, and Retrieval-Augmented Generation (RAG).
- Perform data preprocessing, feature engineering, and model evaluation on large-scale structured and unstructured datasets.
- Collaborate with cross-functional teams to translate business problems into ML solutions and present insights to technical and non-technical stakeholders.
- Continuously improve model performance, scalability, and efficiency.
Required Skills
- 5+ years of experience in Data Science, Machine Learning, or AI-focused roles.
- Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related field.
- Strong proficiency in Python (Pandas, NumPy, Scikit-learn).
- Hands-on experience with TensorFlow or PyTorch.
- Proven experience with NLP and ML model development, including a strong understanding of AI/ML principles.
- Experience working with LLMs (BERT, GPT, RoBERTa), fine-tuning, prompt engineering, and RAG.
- Experience with cloud platforms (AWS, Azure, or GCP).
- Strong SQL skills and experience handling large datasets.
- Excellent communication and stakeholder management skills.
Preferred Skills
- Experience with MLOps (deployment, monitoring, lifecycle management) and familiarity with CI/CD, Docker, and Kubernetes.
- Experience with model explainability (XAI), data governance, privacy, and compliance.
- Knowledge of AI automation tools/frameworks and exposure to telecom or customer interaction data.