You will design, develop, and optimize AI solutions focusing on RAG models and large language model integration.
Responsibilities
- Design and optimize RAG models using Gemini AI and other LLM frameworks.
- Implement and manage vector databases to improve retrieval efficiency.
- Develop AI/ML solutions using Python, TensorFlow, and deep learning frameworks.
- Integrate AI solutions with GCP services including BigQuery and Airflow.
- Execute cost optimization strategies for AI workloads on Google Cloud Platform.
Required Skills
- 5+ years of experience in AI/ML development.
- Expertise in RAG models and Gemini AI.
- Proficiency in Python and SQL.
- Hands-on experience with TensorFlow and PyTorch.
- Experience with Google Cloud Platform (GCP), including BigQuery, Vertex AI, and AI Platform.
- Practical knowledge of vector databases such as Pinecone, FAISS, or Weaviate.
- Understanding of machine learning, deep learning, and NLP concepts.
- Experience with PostgreSQL and cloud-based tool integration.
- Competency in infrastructure tools including Docker, Kubernetes, Terraform, and CI/CD pipelines.
Preferred Skills
- Experience with Large Language Models (LLMs) and generative AI frameworks.
- Familiarity with MLOps and cloud-based model deployment.
- Proven track record of optimizing AI workloads on GCP.