Description

You will evaluate and compare LLM-generated code quality. You will assign ratings, provide technical explanations, and write code snippets to address specific prompts.

Responsibilities

  • Evaluate two LLM-generated code responses for quality.
  • Assign better or worse ratings to code outputs.
  • Provide technical explanations for your evaluations.
  • Write code snippets to address specific prompts.

Required Skills

  • 2+ years of experience with Python.
  • Working knowledge of Java.
  • Ability to write functional code snippets based on prompts.
  • Experience evaluating code quality and logic.
  • Graduate degree in any field.

Key Skills
Education

Any Gradute