Job Description

We are sharing a specialised part-time consulting opportunity for bilingual professionals with strong English and Arabic language skills, analytical judgment, and experience evaluating general-purpose conversational AI systems.

This role supports leading AI teams working to improve the quality, usefulness, and reliability of large language models used across a wide range of everyday and professional scenarios.

Selected professionals will help evaluate model-generated responses, fact-check outputs using trusted public sources, provide structured human feedback, and contribute to improving how advanced AI systems communicate across diverse topics and use cases.

Key Responsibilities

Professionals in this role may contribute to:

AI Response Evaluation
Evaluate LLM-generated responses based on their ability to effectively answer user queries
Assess reasoning quality, clarity, tone, completeness, and overall usefulness of model outputs
Ensure responses align with expected conversational behaviour and system guidelines

Fact-Checking & Feedback
Conduct fact-checking using trusted public sources and external tools
Generate high-quality human evaluation data by annotating response strengths, weaknesses, and factual inaccuracies
Identify reasoning errors, communication gaps, and subtle issues in model outputs

Annotation Quality & Consistency
Apply consistent annotations by following clear taxonomies, benchmarks, and detailed evaluation guidelines
Produce clear, reproducible evaluation artifacts
Help surface issues before public deployment and support measurable improvements in response quality

Ideal Profile

Strong candidates may have:
Bachelor's degree
Native-level Arabic proficiency or ILR 5 / primary fluency, equivalent to CEFR C2
Strong fluency in English
Significant experience using large language models and understanding how and why people use them
Excellent writing skills and ability to clearly articulate nuanced feedback
Strong attention to detail and ability to notice subtle issues others may overlook
Adaptability across topics, domains, and customer requirements
Background in fields requiring structured analytical thinking such as research, policy, analytics, linguistics, or engineering
Excellent college-level mathematics skills

Preferred qualifications

Prior experience with RLHF, model evaluation, or data annotation work
Experience writing or editing high-quality written content
Experience comparing multiple outputs and making fine-grained qualitative judgments
Familiarity with evaluation rubrics, benchmarks, or quality scoring systems
Ability to contribute in both full-time and part-time contract work settings

Why This Opportunity

Work at the frontier of human-in-the-loop AI development
Help shape how advanced language models behave in real-world settings
Contribute to improving AI systems used by millions of people
Flexible remote contract work with competitive compensation

Contract Details

Independent contractor role
Fully remote with flexible scheduling
Compensation of $22.64/hr
Full-time or part-time contract work
Weekly payments via Stripe or Wise
Projects may be extended, shortened, or concluded early depending on project needs and performance
Work will not involve access to confidential or proprietary information from any employer, client, or institution
Please note: We are unable to support H1-B or STEM OPT candidates at this time
Location restricted to Egypt, Saudi Arabia, UAE, or the United States

About the Platform

This opportunity is available through a leading AI-driven work platform that connects domain experts with frontier AI research projects.

Experts contribute to improving advanced AI systems by providing specialised expertise across model evaluation, fact-checking, structured annotation, and conversational quality assessment.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy:

Job Tags

Remote job, Weekly pay, Full time, Contract work, Part time, For contractors, Flexible hours

Similar Jobs

Lane Valente Industries

Traveling Journeyman Electrician Job at Lane Valente Industries

...driven to be highly productive members of the team with an emphasis on integrity and learning. CURRENT OPPORTUNITY: Traveling Journeyman Electrician Responsible for the installation of electrical systems in the commercial construction markets at job sites and...

LP Consulting

Ultimate Experience Travel Coordinator Job at LP Consulting

...Are you passionate about helping others create unforgettable travel experiences? As a Ultimate Experience Travel Coordinator, youll work with clients to plan and book their perfect getaways, whether its an exhilarating cruise adventure or a relaxing resort vacation...

Rep Labs Fulfillment

Warehouse Associate Job at Rep Labs Fulfillment

...company's mission is to support growth by offering streamlined services that optimize order processing, inventory management, and shipping. With a focus on precision and timely delivery, Rep Labs Fulfillment helps eCommerce companies scale confidently with trusted fulfillment...

Centerview Health

Family Medicine Physician Job at Centerview Health

...strong candidates A competitive salary Full coverage of health, dental, and vision insurance. We are seeking a Full-Time or Part-Time Primary Care Physician to join our... ...-centered medical group focused on helping physicians do what they do best: care for...

Northwestern Medical Group

Medical Assistant-Pediatrics Full Time Days Job at Northwestern Medical Group

...talent in the nation leading the way in medical innovation and breakthrough research with... ...located at the Glenview clinic for the Pediatrics specialty. Schedule is Monday through Saturday... ...is preferred.** The Medical Assistant reflects the mission, vision, and values...

Remote | Generalist - English & Arabic — $22.64/hr Job at 24-MAG, San Francisco, CA

UGdwZmU2d1duQndkcUVIN2JhYnpIbk41N3c9PQ==