Robotics Evaluation Specialist

Noida, Indien

Experience: 4+ years
Job Type: Full-time (Contractual / Freelancing)
Location: Remote
Required Skills: Strong attention to detail, Clear written communication, Robotics, English proficiency

Job Summary:

In this role, you'll apply your expertise to help train next-generation AI systems. Your work will shape how models learn, reason, and perform through high-quality, real-world input.

Key Responsibilities:

  • Carefully follow detailed task instructions to execute physical actions on camera for robotics evaluation.
  • Review and analyze system responses to your interactions, ensuring accurate performance assessments.
  • Provide clear, structured written feedback on the accuracy of robotic system evaluations.
  • Document observations and potential edge cases in a consistent, organized manner.
  • Collaborate asynchronously with the customer’s team to refine evaluation protocols and share insights.
  • Maintain a high standard of data integrity and confidentiality throughout all evaluation activities.
  • Contribute to the continuous improvement of robotics evaluation pipelines based on real-world findings.

Required Skills and Qualifications:

  • Proven experience with robotics pipelines or related hands-on robotics projects.
  • Exceptional attention to detail and the ability to notice subtle differences in task outcomes.
  • Clear, concise written and verbal communication skills in English; native or near-native preferred.
  • Ability to follow complex instructions precisely and reliably without deviation.
  • Strong generalist mindset, comfortable handling a range of simple physical tasks on camera.
  • Proficiency in documenting and reporting structured feedback.
  • Reliable internet connection and suitable environment for remote video-based tasks.

Preferred Qualifications:

  • Experience with paused or historical robotics evaluation pipelines.
  • Background in robotics engineering, human-computer interaction, or automation testing.
  • Familiarity with providing evaluative feedback or participating in user testing cycles.