Back to all roles

Remote Bilingual Italian Generalist Evaluator Expert

Remote-first Full-time Now hiring

Mercor is seeking native Italian speakers from Switzerland or Italy with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. Freelancers will author Italian / English prompt–golden answer pairs that train and evaluate advanced language models. Job Details

  • Multilingual Prompt Design & Optimization: Create detailed prompts in Italian and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Italian-speaking users in Switzerland and Italy contexts.
  • Define and Document Evaluation Standards: Establish high-level expectations for correct responses in Switzerland and Italy consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions specific to these regions.
  • Model Testing and Grading (Bilingual): Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Italian, comparing results against English where needed.
  • Benchmarking & Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor—maintaining consistency and reliability across Italian-language benchmarks before integration into official evaluations.

Minimum Qualifications

  • Native-level fluency in Italian (written), specific to Switzerland or Italy usage, with strong reading/writing ability in English.
  • Must be native to Switzerland or Italy and have lived in or spent significant time in-country, with deep cultural and linguistic familiarity.
  • BS or BA from a reputable institution (completed or in progress).
  • Strong writing and critical thinking skills.
  • Ability to work independently and meet deadlines.
  • Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests.
  • Based in Switzerland or Italy (or able to reliably produce Switzerland- or Italy-specific, culturally accurate Italian).

Preferred Qualifications

  • Experience in teaching, research, editing, or academic writing.
  • Experience creating evaluation criteria, rubrics, or grading guidelines.
  • Familiarity with LLMs, prompting, or model evaluation (helpful but not required).

Application & Onboarding Process

  • Complete an AI-led interview (about 15 minutes).
  • If approved, complete a paid assessment focused on writing and rubric creation.
  • Then, if selected, you will be invited to work on the project.

More Details About This Role

  • Expect to contribute at least 20 hours per week.
  • Expect a commitment of approximately 2–4 months.
  • You’ll be working in a structured project environment with clear goals and tools.
  • We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.

Apply tot his job Apply To this Job

More remote roles

Bilingual Insurance Evaluator Arabic/English

Remote-first Full-time

STEM Master’s/Ph.D. Research Report Evaluator

Remote-first Full-time

ARMENIAN (WESTERN) TESTING EVALUATOR

Remote-first Full-time

[Remote] Technology Product Owner – Entry Filing for T01/T11

Remote-first Full-time

CRM Product Owner

Remote-first Full-time

Product Manager | 2 openings | MN or Telecommute

Remote-first Full-time

Sr. Technical Product Manager- Microsoft

Remote-first Full-time

Senior Full Stack Developer, Product Owner – Real-Time Intelligent Communication Systems

Remote-first Full-time

Associate Product Owner - Provider Services (Open to hiring at the Product Owner level)

Remote-first Full-time

Principal Technical Product Manager, Application Infrastructure

Remote-first Full-time

Experienced Home-Based Chat Support Representative – Immediate Start, No Experience Required

Remote-first Full-time

Product Owner, Agentic AI

Remote-first Full-time

Esports Game Coach - Marvel Rivals

Remote-first Full-time

Remote Data Entry Specialist – Online Market Research & Customer Support Professional

Remote-first Full-time

Litigation Insurance Adjuster (Remote)

Remote-first Full-time

Experienced Customer Support Representative – Remote Work Opportunity at arenaflex

Remote-first Full-time

Digital Sales Representative - DateCodeGenie devices

Remote-first Full-time

Business Compliance Liaison Director

Remote-first Full-time

Experienced Customer Experience Consultant (Remote) - Raleigh

Remote-first Full-time

Demand Generation Associate

Remote-first Full-time