Ai Trainer
Job Summary:
Join our customer’s team as an Ai Trainer and help shape the future of coding AI. You will collaborate with expert programmers and LLMs to curate high-quality, multi-turn programming conversations that power next-generation models. If you are passionate about programming, clear communication, and AI-driven innovation, this role offers a unique chance to make your mark in a rapidly evolving industry.
Key Responsibilities:
- Develop multi-turn Cursor-style coding conversations (10+ turns) with optimal, expert-crafted responses at each step.
- Design and deliver golden responses for each interaction, including code reviews, refactorings, and in-depth explanations.
- Create comprehensive evaluation rubrics (~30 items) for the most complex conversation turns, referencing both prior and subsequent context.
- Document complete prompt chains, including code snippets, user prompts, and expert responses.
- Annotate conversations with thorough metadata, covering domain, language, complexity, and rubric-assigned turns.
- Collaborate asynchronously and communicate findings and feedback clearly and concisely to team members.
- Continuously ensure the quality, coherence, and pedagogical value of all conversation content.
Required Skills and Qualifications:
- Proficiency in Python and JavaScript, with a solid grasp of best practices in both languages.
- Exceptional written and verbal communication skills, with meticulous attention to clarity and detail.
- Proven experience producing code explanations, reviews, and documentation for technical audiences.
- Ability to reason through and construct multi-step coding dialogues, referencing evolving conversation history.
- Strong analytical skills to evaluate and synthesize complex programming concepts and interactions.
- Highly self-motivated, reliable, and comfortable working remotely in a part-time, flexible environment.
Preferred Qualifications:
- Background in AI training, prompt engineering, or language model evaluation.
- Experience designing rubrics or assessment criteria for programming tasks or code quality.
- Prior exposure to building or curating datasets for machine learning use cases.