Educational Technology AI Rater & Evaluator

Remote-first Full-time Now hiring

Overview

LILT is building a global network of domain experts to support high-quality AI evaluation across training, benchmarking, red‑teaming, and ongoing model monitoring. We are seeking education and learning professionals to contribute expert judgment to human‑in‑the‑loop AI evaluation workflows used by leading enterprises and hyperscalers. This role is designed for professionals who understand how educational content, learning experiences, and instructional systems work in real-world academic and professional learning environments and who can apply that expertise to evaluate, assess, and improve multilingual AI systems. Your contribution of expertise will directly influence multilingual AI model quality, safety, and deployment readiness. This role includes two distinct expert tracks, based on experience level and scope of responsibility. Track A: EdTech AI Rater Raters execute structured evaluation tasks using clearly defined rubrics and instructions.

Responsibilities

Evaluate AI outputs related to educational, instructional, and learning content
Perform structured scoring, comparison, classification, and judgment tasks
Assess pedagogical accuracy, clarity, appropriateness, and learning effectiveness
Identify hallucinations, misleading explanations, factual errors, or unsafe educational guidance
Apply domain‑specific education and instructional guidelines consistently across tasks

Ideal Background

Educators, instructional designers, curriculum developers, or learning professionals
Experience with teaching, curriculum design, assessment, or educational technology
Strong attention to detail and comfort working with structured evaluation criteria

Track B: EdTech AI Evaluator (Senior Track) Evaluators provide higher‑level domain oversight and help shape how evaluation is performed.

Responsibilities

Validate and refine evaluation rubrics and edge‑case handling
Perform adjudication where raters disagree
Conduct error analysis and qualitative reviews of model behavior
Partner with LILT research, product, and customer teams on evaluation design
Support red‑teaming, educational quality review, and model readiness assessments

Ideal Background

Senior educators, academic leaders, learning scientists, or education subject matter experts
Experience defining instructional standards, reviewing complex edge cases, or advising on learning outcomes
Ability to clearly explain nuanced pedagogical reasoning and tradeoffs

Evaluation Focus & Requirements Types of AI Evaluation Work Depending on project demands, work may include:

Educational and instructional content evaluation
Learning accuracy and conceptual understanding assessment
Benchmarking and comparative model analysis
Red‑teaming for misleading or harmful educational content
Ongoing model monitoring and regression testing

What We Look For

Deep domain expertise in education, instructional design, or learning sciences
Strong judgment and ability to apply criteria consistently
Comfort working with structured evaluation workflows
Ability to explain reasoning clearly, especially in instructional or learner‑facing scenarios
Reliability, professionalism, and respect for quality standards

Engagement Model

Contract‑based, flexible participation
Project‑based work with clear expectations and timelines
Opportunities for recurring work based on performance and demand
Compensation communicated upfront per project or task type

Why This Work Matters Your expertise helps ensure that AI systems:

Provide accurate, effective, and responsible educational content
Align with instructional best practices and learning standards
Are trustworthy and supportive for learners across languages

Language Requirements

Native or professional fluency in one or more supported languages is required.
Supported languages span 30+ global languages.
Language‑specific nuance is assessed through screening and task‑based evaluation, not separate job descriptions.
English fluency is required for guidelines, feedback, and collaboration.

AI is changing how the world communicates — and LILT is leading that transformation. LILT's mission is to make the world’s information available to everyone, no matter the language they speak. Join our global community who thrive on innovation and excellence. Our collective knowledge, uniqueness, and skills deliver multilingual AI and human‑verified services to Enterprises, Governments, and AI Developers around the world. Earn money. Have fun. Advance human knowledge. Work on diverse projects from anywhere, any time you want. Get paid quickly and fairly, and build your professional network in a supportive community—all through a streamlined application process tailored to your expertise. Information collected and processed as part of your application process, including any job applications you choose to submit, is subject to LILT's Privacy Policy at https://lilt.com/legal/privacy. At LILT, we are committed to a fair, inclusive, and transparent hiring process. As part of our recruitment efforts, we may use artificial intelligence (AI) and automated tools to assist in the evaluation of applications, including résumé screening, assessment scoring, and interview analysis. These tools are designed to support human decision‑making and help us identify qualified candidates efficiently and objectively. All final hiring decisions are made by people. If you have any concerns, require accommodations, or would like to opt‑out of the use of AI in our hiring process, please let us know at [email protected]. LILT is an equal opportunity employer. We extend equal opportunity to all individuals without regard to an individual’s race, religion, color, national origin, ancestry, sex, sexual orientation, gender identity, age, physical or mental disability, medical condition, genetic characteristics, veteran or marital status, pregnancy, or any other classification protected by applicable local, state or federal laws. We are committed to the principles of fair employment and the elimination of all discriminatory practices. Apply tot his job Apply To this Job

Apply

Educational Technology AI Rater & Evaluator

Overview

Responsibilities

Responsibilities

More remote roles

Vocational Evaluator

AI Decision & Response Analyst

NURSE EVALUATOR III, HEALTH SERVICES

Finance Model Prompt Evaluator

AI Quality Evaluator (Polish)

Healthcare Research Evaluator (STEM) | $30/hr Remote

Generative AI Evaluator (Russian) | $15/hr Remote

Product Manager - Healthcare (Remote)

Product Owner (Specialty Lines Insurance)

Product Owner – Digital Enablement

Ecommerce Brand Manager

Work From Home – Benefits Coordinator (Hiring Now | Flexible Schedule | Entry Level)+

Medical Account Specialist II - PHOENIX S, AZ

Experienced Customer Support Lead (Night Shift) – Driving Exceptional Customer Experiences at arenaflex

Field Engineer - Data Integration & AI

Experienced Revenue Cycle Data Entry Specialist – Behavioral Health Revenue Cycle Operations

Experienced Full Stack Data Entry Specialist – Remote Data Management for arenaflex

Experienced Customer Service Representative – Virtual Call Center Support

Experienced Full Stack Data Analyst – Voice of the Customer Methodology Development and Innovation

Lead HR Business Partner