Back to all roles

Professional Evaluator - Fully Remote | Upto $35/hr Hourly

Remote-first Full-time Now hiring

About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey. Position: AI Model Evaluation Contractor Type: Contract Compensation: $25–$35/hour Commitment: 20 hours/week Role Responsibilities

  • Write realistic prompts reflecting professional and consumer domain-specific guidance.
  • Evaluate AI-generated responses for factual accuracy, regulatory correctness, and practical usefulness.
  • Identify fabricated claims, incorrect references, or misleading reasoning in model outputs.
  • Score and rank multiple model responses using structured rubrics across dimensions.
  • Provide written justifications with specific evidence for each evaluation.

Qualifications

Must-Have

  • Professional experience applying domain expertise in a practitioner or advisory capacity.
  • Familiarity with industry-specific standards, regulations, or clinical guidelines.
  • Strong written communication and critical reasoning skills.

Application Process (Takes 20–30 mins to complete)

  • Submit your resume to begin.
  • Complete the Model Response Evaluation assessment.

Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. Apply tot his job Apply To this Job

More remote roles

Audio Evaluator - Fully Remote | Upto $50/hr Hourly

Remote-first Full-time

Special Investigations Unit, Investigator- Remote

Remote-first Full-time

Healthcare Fraud Investigator - Case Development- Remote

Remote-first Full-time

Client Service Advisor

Remote-first Full-time

(US) Customer Success Manager, Senior Living – Remote, USA

Remote-first Full-time

Logistics Coordinator (Entry Level)

Remote-first Full-time

Coordinator, Talent

Remote-first Full-time

Remote | travel logistics coordinator

Remote-first Full-time

Remote Backend Data Entry Jobs for College Students

Remote-first Full-time

Remote Customer Onboarding Specialist – Tech Services

Remote-first Full-time

Social Worker – Family Safeguarding

Remote-first Full-time

Enterprise Workday Administrator

Remote-first Full-time

Evening Gown & Cocktail Dress Seamstress – Alterations – Yuba City, CA

Remote-first Full-time

Part-Time Remote Cybersecurity Analyst – Network Protection & Incident Response – Data Entry Specialist – $25/hr – Flexible Hours

Remote-first Full-time

Experienced E-Commerce Customer Business Manager – Driving Sales Growth and Category Leadership at arenaflex

Remote-first Full-time

Flexible Remote Part-Time Data Entry Specialist – Product Information Management & Quality Assurance

Remote-first Full-time

Experienced Full Stack Customer Service Representative – Work-From-Home Opportunity with arenaflex

Remote-first Full-time

Experienced Customer Service Representative – Medicare Appeal Process Support

Remote-first Full-time

Clinical Administrative Coordinator - (Remote) | Maximus | Handshake

Remote-first Full-time

Manager, Real Estate

Remote-first Full-time