Description
You will design, develop, and deploy AI/ML solutions specializing in Computer Vision and Natural Language Processing.
Responsibilities
- Design and optimize deep learning models for image classification, object detection, and semantic segmentation.
- Build and fine-tune NLP models for text classification, sentiment analysis, and entity recognition.
- Preprocess, augment, and clean large multimodal datasets for vision and text pipelines.
- Deploy models into production environments and collaborate with engineering teams for system integration.
- Experiment with Transformers, Vision-Language models, and foundation models.
Required Skills
- Minimum 5 years of professional experience in machine learning with a focus on CV and NLP.
- Proficiency in Python and frameworks including TensorFlow, PyTorch, and Hugging Face.
- Hands-on expertise with OpenCV, Dlib, and text embeddings such as BERT and GPT.
- Strong knowledge of deep learning architectures including CNNs, RNNs, Transformers, and Vision Transformers (ViT).
- Experience with NLP libraries like spaCy, NLTK, or OpenAI APIs.
- Familiarity with cloud platforms including AWS, GCP, or Azure.
- Experience with MLOps tools for model deployment.
- Proficiency with Pandas, NumPy, and SQL.
Preferred Skills
- Experience in multimodal AI applications combining CV and NLP.
- Knowledge of OCR techniques and frameworks.