AI Data Labeling Exploitation: Who Really Makes AI "Safe"

It's hour four of a nine-hour shift in Nairobi. A young man we'll call James has read about two hundred passages of text today, most describing things you would not want described to you over dinner: torture, child abuse, the precise mechanics of a suicide. His job is to tag each one so a chatbot eight thousand miles away will learn to refuse to produce them. He's paid roughly two dollars an hour. There's a counselor he can talk to, in theory, if he can find the time between quotas.

James doesn't appear in any keynote or marketing deck. When the company that hired his employer's employer talks about its "safe" and "responsible" AI, it doesn't mention him. And yet without James — and a few million people like him — the model you used this morning would be a useless, dangerous mess.

We talk endlessly about GPUs, parameters, and architectures. We almost never talk about the people who hand-feed these systems the difference between pedestrian and mailbox, helpful and toxic. So let's talk about them — and about why squeezing them for pennies doesn't just hurt them. It quietly wrecks the product in your pocket. Think of a frontier AI model as a gleaming tower: everyone admires the penthouse. Almost nobody asks who poured the foundation — or what happens to the whole building when that foundation is mixed cheap.

What Is Data Labeling and Why AI Can't Exist Without It

Here's the uncomfortable secret under the hype: modern AI does not learn "straight from the internet." It learns from examples that humans have carefully marked up. So what is data annotation — and what is AI data labeling? Same question, same answer: the work of attaching meaning to raw data — pointing at a picture and saying this is a pedestrian, reading a sentence and saying this is hate speech, comparing two chatbot replies and saying this one's better. It's the floor everything else stands on.

This is also a real and enormous job market. Search "data annotation jobs" and you'll find tens of thousands of listings; AI training jobs are now a genuine global category, from full-time annotators to gig taskers clicking micro-assignments at midnight. The work breaks into a few layers, each harder and more consequential than the last.

Image Annotation: Teaching Models to See

The simplest layer is image annotation. A human looks at a photo or video frame and draws boxes: here's a car, here's a sign, here's a child, here's a trash can. Image annotation AI systems — self-driving cars, medical-scan readers, satellite analysis, security cameras — need millions of these labels before they stop confusing a toddler with a fire hydrant.

It sounds trivial. It isn't. Trace the wrong outline ten thousand times and you've taught a two-ton vehicle a subtly wrong idea of what a person looks like. The boring work is the load-bearing work.

Content Moderation: The Human Filter Behind AI Safety

One floor up is AI content moderation, where the job stops being tedious and turns hazardous. Someone has to look at the worst material the internet produces — graphic violence, sexual abuse, the genuinely unspeakable — and label it off-limits. Only after a human has seen it can the model learn not to show it to you.

That's the grim bargain at the heart of AI safety: a person absorbs the horror first so the machine can deflect it later. The human filter is a chair, a screen, and a content moderator's mental health quietly eroding in real time.

RLHF (Reinforcement Learning from Human Feedback) Explained

The most delicate layer is RLHF. So what is RLHF? Reinforcement learning from human feedback is the process that turns a rambling text-predictor into something that feels like a helpful assistant. A person is shown two of the model's answers to the same prompt and picks the better one. Do that a few million times and the model learns to prefer clear over muddled, polite over rude, true over invented.

Picture the actual task. The prompt: "Explain why the sky is blue to a six-year-old." Answer A is accurate but reads like a physics textbook. Answer B is charming but fudges the science. Which is "better"? The honest answer is it depends — and a tired reviewer racing a quota has maybe half a second to decide. That single ambiguous judgment, multiplied across millions of comparisons, sculpts the model's entire sense of a good answer. RLHF is the human in the loop AI that everyone praises and nobody pays well.

Behind that half-second click usually sits a 40-plus-page guideline document defining exactly what "better" means. Reading it carefully takes longer than the task pays for. Guess which one wins.

How Big Is the Data Labeling Industry? (Scale AI, Appen, Market Size)

This is no cottage industry. Depending on how you count, the data labeling market size sat around $18 billion in 2024 for solutions and services, compounding at a brutal clip as every company tries to "do AI."

The data labeling companies behind it are household names in the trade. Scale AI built a contractor network across Kenya, the Philippines, and Venezuela; Australia's Appen claims more than a million contractors speaking 235+ languages across 170 countries. Scale AI's 2024 revenue ran to roughly $870 million; a May 2024 round valued it near $13.8 billion, and in June 2025 Meta paid about $14.3 billion for a 49% stake, valuing it at $29 billion. Data work isn't a side market. It's the bedrock.

Why Cheap Data Labeling Means a Worse AI Product

Now the thesis, plainly: a model is only ever as good as the labels it learns from. And labels are only as good as the conditions of the human producing them.

The industry has a quiet quality metric called inter-annotator agreement — do several labelers independently reach the same verdict? Low pay destroys it, because exhausted, rushed people guess. One mislabeled toxicity category and the model waves through a whole class of harmful requests. Cutting costs on labeling isn't trimming maintenance. It's pouring cheap concrete into the foundation and hoping the penthouse doesn't notice. Companies cut where labor law is weakest. The first testing ground was Kenya.

Kenya: The Hidden Cost of AI Content Moderation

Sama, OpenAI, and the "Ethical AI" Promise

In 2019, Meta opened its first content-moderation hub in sub-Saharan Africa, in Nairobi. The contractor running it was Sama, formerly Samasource — headquartered in San Francisco but operating mostly in East Africa, and marketing itself as an "ethical AI" company lifting people out of poverty through dignified tech work. Its client roster reportedly touched a quarter of the Fortune 50.

When OpenAI signed on in 2021, Samasource's Kenyan workers got handed the job nobody else wanted: labeling the worst of the internet so ChatGPT could learn to refuse it. The "ethical AI" of the press release and the production line in Nairobi were separated by a gulf.

How Much Are Kenyan AI Moderators Paid?

A Time investigation by Billy Perrigo did the arithmetic the contracts obscured. OpenAI paid Sama around $12.50 an hour per worker. The workers themselves took home between $1.32 and $2 an hour. The spread between those two numbers isn't a rounding error. It's the business model.

Economists have a name for that gap: labor arbitrage — routing work to wherever it's cheapest. The market shape has a name too: monopsony, where a handful of buyers set the price for a vast pool of sellers who have nowhere else to go. The workers read 150 to 250 grim passages per shift. The savings flowed north.

Psychological Trauma and the Daniel Motaung Lawsuit

You cannot stare into the internet's basement for eight hours a day at $2 an hour and walk away intact. The content moderation trauma here is documented and severe: PTSD, insomnia, broken relationships. Content moderator mental health was, by multiple accounts, an afterthought — wellness support that was thin, generic, and hard to reach.

Daniel Motaung, a former moderator, says he developed PTSD on the job and was fired after trying to organize his colleagues. His lawsuit against Meta and Sama became the opening case in a wave of litigation — a test of whether the harm done to these workers counts as a cost the companies must carry.

Blacklisting, Layoffs, and the Majorel Contract

In early 2023, Sama announced it was exiting content moderation to focus on computer vision; 260 moderators got layoff notices. The Meta contract moved to Luxembourg-based Majorel. Former Sama staff applied en masse. Not one, they say, was called for an interview.

A lawsuit filed in March 2023 by 184 moderators (the number later grew) alleged Majorel's recruiters were explicitly told not to hire anyone from Sama — that the workers had been blacklisted for trying to unionize. It's how the system treats organizing: not with a fight, but with a quiet door that never opens.

A Legal First: Can Meta Be Sued Where It Has No Office?

Then came the precedent that should keep corporate lawyers up at night. Meta tried to get Motaung's case tossed, arguing Kenya had no jurisdiction because the company isn't registered or trading there. In 2023 the Employment and Labour Relations Court disagreed, ruling Meta could be named as a defendant. In September 2024, Kenya's Court of Appeal upheld that, clearing 185 former moderators to take Meta to trial.

This is the first ruling of its kind anywhere in the world. It potentially cracks the whole parent-company → contractor → subcontractor shield that global outsourcing is built on — the layers that were supposed to make the company at the top untouchable.

Africa's First Content Moderators' Union

In May 2023, more than 150 workers labeling content for Facebook, TikTok, and ChatGPT voted to form Africa's first content moderators' union, backed by the Communications Workers Union of Kenya. For some of the lowest-paid jobs in global tech, it was the first time the people doing them had a collective voice instead of a non-renewed contract.

How Worker Burnout Degrades Labeling Quality

Here's the part the spreadsheets miss. When a person spends eight hours absorbing trauma for $2 an hour, by hour four they've stopped being a careful labeler. An AI trained by exhausted people for pennies is, by construction, worse than one trained by rested specialists. The cruelty and the quality problem are one and the same.

The Philippines: Inside the AI Microtask Economy

Scale AI, Remotasks, and the SEPI Subsidiary

If Kenya is a story about the human psyche, the Philippines is a story about arithmetic. In 2019, Scale AI set up a local subsidiary — Smart Ecosystem Philippines Inc., or SEPI — to run its Remotasks platform. Offices opened in Cagayan de Oro; thousands worked from home and from internet cafés. Estimates of the total Philippine workforce range from 10,000 to two million — nobody knows the real figure, including the government.

How the Microtask Payment Model Works

The model is brutally simple. You open the platform and see a queue: trace the car in this clip, transcribe this audio, pick the better of two chatbot answers. Each task is priced individually — fractions of a cent, sometimes a few cents, occasionally a dollar. No employment relationship: you're a contractor, the platform pays per task, and everything else — your hours, your health, your bad week — is your problem.

This is the gig economy in its purest form: gig economy workers reduced to a queue of piecework, classified as independent contractors precisely so no one owes them a minimum wage, a contract, or a sick day. It's a near-perfect engine for gig economy exploitation — the worker has no leverage and no idea who they're working for.

The Race to the Bottom: How Per-Click Rates Collapsed

In 2020–2021, a sharp tasker could clear $150–$200 a week — well above the local minimum of $6–$10 a day. People quit office jobs. Then Scale AI expanded the platform into India and Venezuela, and rates fell off a cliff. According to former workers, pay for identical tasks dropped from around $10 to less than a cent. Not by ten percent. Not by half. By a factor of a thousand. This is the race to the bottom, operating exactly as advertised: the more countries in the pool, the cheaper every worker becomes. Venezuela's collapse supplied a desperate labor force, and Filipino taskers found themselves in the same auction — and lost.

Fair Work Failures and the Washington Post Investigation

The receipts piled up. In 2022, the Oxford Internet Institute's Fairwork project scored Scale/Remotasks a 1 out of 10 on basic fairness. In August 2023, the Washington Post investigation by Rebecca Tan and Regine Cabato put numbers to the Scale AI controversy: payments delayed for months, slashed without explanation, sometimes withheld — one worker paid 30 cents for four hours.

The corporate reply was boilerplate: delays are "extremely rare," systems "continuously improving." The Philippine government's reply was a shrug — a communications secretary calling data labeling an "informal sector" they didn't know how to regulate. Then in March 2024, Remotasks abruptly cut off Kenya, Nigeria, and Pakistan, the email arriving hours before the shutdown, no severance. It's far easier to leave than to be regulated.

Why Half-a-Cent Tasks Produce Low-Quality Training Data

Run the math the way a worker has to. At half a cent per label and a minute per task, you're earning thirty cents an hour. To clear even the local minimum you'd need to fire off four labels a minute. At that speed you don't read the 40-page guideline, you don't check the ambiguous cases — you click something plausible and move on. It isn't laziness; it's survival arithmetic. And that data flows straight into the training sets behind GPT, Claude, and Gemini. The intelligence of a trillion-dollar model rests partly on how much attention a person earning thirty cents an hour could afford to pay.

How Bad Data Labeling Affects the AI Models You Use

The Kenyan and Philippine stories look different — trauma versus tedium, $2 an hour versus half a cent a click — but the blueprint is identical. An American company hires a contractor, the contractor hires workers in a country with weak labor law, and two or three legal layers sit between the brand and the human. The structure isn't a bug. It's the product. Here's how that cheap foundation cracks the building you actually live in.

AI Safety: The Jailbreak Arms Race and Filter Gaps

When ChatGPT politely declines to explain how to build a weapon, that isn't magic. It's Kenyan moderators who labeled the bad stuff so the model could learn the boundary. Label that boundary thinly — by traumatized people, in a hurry, for $2 an hour — and it develops holes.

The ongoing AI jailbreak arms race, where users find ways around guardrails faster than companies can patch them, is partly a direct consequence of underfunded labeling. Every gap in the filter traces back to a gap in the foundation.

Hallucinations: When Rushed RLHF Produces Confident Lies

AI hallucinations — the model stating fiction with total confidence, inventing citations, mangling dates — often start at the RLHF stage. If a reviewer picks the "better" answer in half a second to hit quota, that choice is close to random — and millions of semi-random choices become the model's reward function.

Concretely: a rushed labeler rewards a fluent-but-wrong answer over a clunky-but-correct one. The reward model learns confidence reads as quality, and the finished model delivers polished nonsense with a straight face. One careless click doesn't stay one careless click. It compounds into a behavior.

Algorithmic Bias and Underserved Languages

Algorithmic bias has a labor explanation too. Models work better in English because labeling English pays better and attracts more labelers, so the data is deeper and cleaner. Languages like Amharic, Malagasy, or Tagalog sit below the profitability line and get thinner, noisier labeling — or none. The bias isn't only in the algorithm; it's in the budget that decided whose language was worth annotating well.

The Alignment Problem as a Labor Problem

Here's the line the industry doesn't put on the slide. The AI alignment problem — the grand challenge of making models reliably do what we want — is usually framed as math and philosophy. But strip away the abstraction and a large chunk of it is a labor problem.

A company that cuts costs to the bone on the humans who define safe, helpful, and true cannot, by construction, produce a reliably safe, helpful, truthful model. The mismatch between values and behavior often begins not in the weights, but in the wage.

What's Changing in AI Labor Regulation

Court Precedents and the EU AI Act on Training Data Transparency

A few things are genuinely shifting. Kenya's courts ruled a US tech giant can be sued where it has no office — a precedent that quietly threatens the entire outsourcing shield. Africa's first content moderators' union exists now. Journalists at Time, the Washington Post, and the Guardian keep pulling the curtain back.

And regulation is inching in. The EU AI Act, in force since August 2024, now requires providers of general-purpose AI to publish a detailed summary of their training data, with a mandatory disclosure template rolling out from 2025. It targets copyright and provenance today, but transparency about what's in the data is one short step from transparency about who labeled it and how they were treated. Researchers are pushing a parallel idea — "datasheets for datasets" and supply-chain disclosure, the AI equivalent of a nutrition label.

What Isn't Changing: The Logic of the Race to the Bottom

Now the cold part. As long as cheaper jurisdictions exist, the work drifts toward them — exactly as Remotasks drifted from unionizing Kenya to crisis-stricken Venezuela. Outsourcing presupposes mobility: a structural slide toward wherever the rules are weakest and the desperation deepest. The trauma and the bad labels are externalities — costs shoved onto workers and users, kept off the company's books.

This is the lineage Mary Gray and Siddharth Suri named ghost work in their 2019 book — human labor hidden behind the curtain of "automation," running back to Amazon's Mechanical Turk and, further still, to an old colonial pattern: value extracted from the Global South for products consumed comfortably in the North. The marketing hides behind warm words — safe, ethical, aligned, responsible — but pull the chain to its end and you find a person labeling abuse for $2 an hour, or clicking microtasks for half a cent.

Will synthetic data and AI-assisted labeling fix this? Maybe partly. But letting the model help label its own training data mostly risks burying the human deeper and laundering the bias, not removing it. So next time someone tells you AI "thinks for itself," remember the floor below the floor. The question was never whether a human stands behind the answer. It's how little they were paid — and how much they had time to notice.

Questions.

What is data labeling in AI?

Data labeling — also called data annotation — is the work of attaching meaning to raw data so a machine can learn from it: boxing objects in images, tagging text as toxic or safe, transcribing audio, or comparing two AI answers to mark the better one. It's the human-generated answer key that supervised machine learning and RLHF both depend on. No labels, no model.

How much do AI data labelers earn?

It varies wildly by region and platform. Kenyan content moderators on the OpenAI project took home $1.32 to $2 an hour while the client was billed around $12.50. On microtask platforms, AI data labeling jobs can pay fractions of a cent per task — workers have reported effective rates as low as thirty cents an hour after rates collapsed. Pay in the Global North runs far higher, which is exactly why the work gets routed elsewhere.

Which companies use overseas data labelers?

Most major AI developers rely on labeling through intermediaries. Scale AI (via Remotasks and its SEPI subsidiary), Appen, and Sama/Samasource are among the best-known data labeling companies, supplying or having supplied OpenAI, Meta, Google, and Microsoft. The layered contractor structure is standard across the industry.

Does cheap data labeling make AI less safe?

Yes — and this is the core of it. Underpaid, overworked labelers produce noisier labels and lower inter-annotator agreement. That degraded data propagates directly into weaker safety filters (easier jailbreaks), more hallucinations from rushed RLHF, and worse performance in under-resourced languages. Is data annotation legit work? Absolutely — it's skilled, essential, and demanding. The problem isn't the work. It's the price.

Sources

References cited in this piece. Last verified on the published or revision date.

01

OpenAI Used Kenyan Workers on Less Than $2 Per Hour

time.com/6247678/openai-chatgpt-kenya-workers
02

Inside Facebook's African Sweatshop

time.com/6147458/facebook-africa-content-moderation-employee-treatment
03

Meta's Attempt to Dodge Major Kenya Court Case Fails

time.com/6253180/meta-kenya-lawsuit-motaung
04

150 African Workers for AI Companies Vote to Unionize

time.com/6275995/chatgpt-facebook-african-workers-union
05

185 Kenya Content Moderators to Trial After Meta Appeal Dismissed

www.foxglove.org.uk/2024/09/23/facebook-content-moderators-kenya-meta-appeal
06

Behind the AI Boom, an Army of Overseas Workers in 'Digital Sweatshops'

www.washingtonpost.com/world/2023/08/28/scale-ai-remotasks-philippines-artificial-intelligence
07

Scale AI's Remotasks Is Dropping Whole Countries Without Explanation

restofworld.org/2024/scale-ai-remotasks-banned-workers
08

Fairwork Cloudwork Rating: Scale AI / Remotasks 2022

fair.work/en/ratings/platforms/cloudwork-scale-remotasks/?y=2022
09

OpenAI and Sama Hired Underpaid Workers in Kenya to Filter Toxic Content for ChatGPT

www.business-humanrights.org/en/latest-news/openai-and-sama-hired-underpaid-workers-in-kenia-to-filter-toxic-content-for-chatgpt
10

Scale AI Revenue and Post-Meta-Deal Reporting

www.cnbc.com/2025/11/04/scale-ais-life-after-meta-has-been-rocky-cfo-insists-not-a-zombie.html
11

Meta Finalizes Investment in Scale AI at $29B Valuation

www.investing.com/news/stock-market-news/meta-finalizes-investment-in-scale-ai-valuing-startup-at-29-billion-4094272
12

EU AI Act — Article 53: Obligations for Providers of General-Purpose AI Models

artificialintelligenceact.eu/article/53
13

Appen — AI Training Data and Global Crowd

www.appen.com/ai-data
14

Data Labeling Solution and Services Market Size and Forecast

www.precedenceresearch.com/data-labeling-solution-and-services-market

AI Data Labeling Exploitation: How Underpaid Workers in Kenya and the Philippines Undermine Model Safety