Description
Design, build, and deploy machine learning and NLP applications with a heavy focus on Generative AI and Large Language Models.
Responsibilities
- Develop and implement solution architectures for Generative AI, conversational AI chatbots, and cloud AIaaS.
- Train, fine-tune, and deploy LLMs using instruction tuning, RLHF, and PEFT (LoRA, adapters).
- Architect RAG systems and transformer-based solutions using LangChain and SQLChain.
- Measure and benchmark LLM performance and application scalability.
- Drive end-to-end implementation and deployment using Azure and AWS services.
Required Skills
- 10+ years of experience in machine learning and artificial intelligence.
- Expertise in LLMs including OpenAI GPT, Anthropic Claude, Meta Llama2, and Google Gemini.
- Hands-on experience with RAG systems, LLM fine-tuning, and Vector Databases.
- Strong programming skills in Python with a focus on optimization and software design.
- Deep understanding of ML/DL techniques, including CNN, RNN (LSTM), and Transformers (BERT, BART, GPT/T5, Megatron).
- Experience training models using PyTorch and performing NLP data wrangling and tokenization.
- Knowledge of conversational AI, including NLU, NLG, dialog systems, and information retrieval.
- Proficiency with MLOps workflows and platforms such as Kubeflow, MLFlow, or AirFlow.
- Database management experience with SQL and MongoDB.
Preferred Skills
- Experience with project management and guiding teams in a matrix environment.