Description

You will design, develop, and optimize AI solutions focusing on RAG models and large language model integration.

Responsibilities

  • Design and optimize RAG models using Gemini AI and other LLM frameworks.
  • Implement and manage vector databases to improve retrieval efficiency.
  • Develop AI/ML solutions using Python, TensorFlow, and deep learning frameworks.
  • Integrate AI solutions with GCP services including BigQuery and Airflow.
  • Execute cost optimization strategies for AI workloads on Google Cloud Platform.

Required Skills

  • 5+ years of experience in AI/ML development.
  • Expertise in RAG models and Gemini AI.
  • Proficiency in Python and SQL.
  • Hands-on experience with TensorFlow and PyTorch.
  • Experience with Google Cloud Platform (GCP), including BigQuery, Vertex AI, and AI Platform.
  • Practical knowledge of vector databases such as Pinecone, FAISS, or Weaviate.
  • Understanding of machine learning, deep learning, and NLP concepts.
  • Experience with PostgreSQL and cloud-based tool integration.
  • Competency in infrastructure tools including Docker, Kubernetes, Terraform, and CI/CD pipelines.

Preferred Skills

  • Experience with Large Language Models (LLMs) and generative AI frameworks.
  • Familiarity with MLOps and cloud-based model deployment.
  • Proven track record of optimizing AI workloads on GCP.

Education

Any Graduate