You will evaluate and compare LLM-generated code quality. You will assign ratings, provide technical explanations, and write code snippets to address specific prompts.
Responsibilities
- Evaluate two LLM-generated code responses for quality.
- Assign better or worse ratings to code outputs.
- Provide technical explanations for your evaluations.
- Write code snippets to address specific prompts.
Required Skills
- 2+ years of experience with Python.
- Working knowledge of Java.
- Ability to write functional code snippets based on prompts.
- Experience evaluating code quality and logic.
- Graduate degree in any field.