How do AI detectors work? AI detectors work by running text through a statistical model and measuring how predictable it is: machine-written prose tends to be smooth and low-surprise, while human writing is messier and more varied. Detectors quantify this with signals such as perplexity (predictability), burstiness (sentence-length variation), token probabilities and stylometric “fingerprints,” then feed those signals into a classifier that outputs a likelihood score rather than a yes/no verdict.
This guide explains the actual mechanics — what each method measures, a worked example you can follow, how the signals combine, and where the technique breaks down. We focus on the how; for the separate question of how trustworthy the verdict is, see our companion guide on whether AI detectors are accurate.
What Are AI Detectors?
Whenever you read an article or a piece of academic writing today, it is natural to wonder whether it was produced by a human or AI. Generative tools such as ChatGPT, Gemini, Claude and Jasper are now everywhere, drafting essays, research papers, blog posts, emails and even poetry. That convenience created a new problem: telling machine text apart from human text. An AI detector is the software built to answer exactly that question.
AI detectors are tools that estimate whether a passage was written by a person or generated by a language model. Think of them as content “inspectors.” Just as an experienced teacher can sometimes sense when a student did not write their own essay, detectors pick up on statistical patterns that tend to reveal a machine author. They are used across schools, publishing, marketing, journalism and even recruitment:
- Teachers check whether students leaned on AI for assignments.
- Publishers screen submissions for originality.
- Businesses keep brand communication authentic.
- Marketers audit blog posts before they go live.
Crucially, a detector never “reads” for meaning. It does not care whether a sentence is funny, moving or even true. It is a pattern-matching engine, and understanding that single fact is the key to understanding how the whole technology works.
How Do AI Detectors Work? The Core Idea
At their heart, AI detectors work on mathematics and probability, not comprehension. A large language model writes by repeatedly predicting the most likely next chunk of text. Because that process favours the “safe” high-probability choice at each step, the output is unusually smooth and statistically average. A detector exploits this by asking a single question of any passage: does this text look like something a language model would most probably produce?
To answer it, the detector extracts a handful of numerical signals from the text and weighs them up. The five that matter most are perplexity, burstiness, token probability, stylometry and a trained machine-learning classifier that ties them together. We will take each in turn, then show how they combine on a real sample.
1. Perplexity — how “surprising” the text is
Perplexity sounds technical, but in plain terms it measures: how surprised would a language model be to see this exact wording? The detector feeds your text to a reference model and checks how confidently that model would have predicted each word.
- Low perplexity: the wording is highly predictable and flows along the “obvious” path — a hint that a machine wrote it.
- High perplexity: the wording is quirky, unexpected or unusually structured — which reads as more human.
Why the gap? Models gravitate to neat, logical, high-probability phrasing. People tend to add personal asides, odd word choices and emotional texture. For example:
- AI might write: “Climate change is a global problem that requires urgent attention.”
- A human might write: “Honestly, it feels like the summers just keep creeping hotter — and we keep pretending we’ll deal with it next year.”
2. Burstiness — the rhythm of the sentences
Humans rarely write with mechanical consistency. We string together a long, winding sentence packed with detail, then drop a short, punchy one. That natural variation in sentence length and structure is called burstiness. Models, by contrast, often output sentences of similar length and even cadence, all polished to the same finish.
Detectors map the flow of the writing. If every sentence looks cut from the same mould, that is a red flag; if the prose swings between long and short, formal and casual, it reads as more human.
- AI-like: Every sentence is neatly structured. Each one is similar in length. The tone stays level throughout.
- Human-like: One sentence rambles on with examples and a tangent or two, and then the next is three words long. That unevenness feels alive.
3. Token probability — the building blocks
To a computer, text is not made of words but of tokens — whole words, fragments of words or single characters. A model generates text by predicting the next token in a sequence, like an endless game of “finish the sentence.” It does not understand language the way we do; it estimates the most probable next piece.
Detectors zoom in on these token-level probabilities. If the sequence is too regular, too smooth or too mathematically likely at every step, the detector grows suspicious. This is also why genuinely human text written in a plain, formulaic style can be wrongly flagged — it accidentally mimics the model’s low-surprise pattern.
4. Stylometry — the writing “fingerprint”
Some detectors go beyond raw probability and study style. Stylometry is the science of a writer’s fingerprint — the measurable habits that make prose distinctive. It looks at things like:
- Word choice (rare, vivid vocabulary versus generic, on-the-nose phrasing).
- Sentence-length distribution (varied or uniform).
- Grammar quirks (the small rule-bending humans do naturally).
- Repetition and favourite connectors (“moreover,” “in conclusion,” “delve into”).
5. Machine-learning classifiers — AI catching AI
Here is the irony: most modern detectors are themselves AI. They are trained on huge datasets containing both human-written and machine-generated samples, learning the statistical boundary between the two. When you paste in new text, the classifier compares its signals against everything it has seen and outputs a probability.
It is a bit like training a sniffer dog. Once it has learned the difference between the scent of chicken and beef, it can tell you which one is in your bag. The detector “sniffs out” AI writing by matching new text against its training experience — and, just like the dog, it can be fooled by an unfamiliar smell.
| Method | What it measures | AI signal (flags as machine) | Human signal |
|---|---|---|---|
| Perplexity | How predictable the wording is to a reference model | Low — smooth, expected phrasing | High — surprising or quirky phrasing |
| Burstiness | Variation in sentence length and rhythm | Low — uniform sentences | High — mixed long and short |
| Token probability | Likelihood of each next token in sequence | Consistently high-probability tokens | Unexpected token choices |
| Stylometry | Vocabulary, grammar habits, repetition | Generic words, repeated connectors | Distinctive voice and rule-bending |
| ML classifier | Overall fit against trained examples | Close match to AI training samples | Close match to human samples |
Take this AI-style sentence: “Time management is an essential skill that helps students achieve academic success and reduce stress.” Every word is the most expected next word, the sentence length is “average,” and the vocabulary is generic — so perplexity is low, burstiness is low, and a classifier leans toward “AI.”
Now a human revision: “I used to think I was ‘bad at time’ — turns out I just never blocked out the messy 20 minutes after lunch when nothing got done anyway.” The dash, the self-correction, the oddly specific detail and the swing in sentence rhythm push perplexity and burstiness up, so the same detector now leans “human.” Notice the detector never judged whether either sentence was true — only how predictable it was.
How the Signals Combine Into a Score
No single signal decides the verdict. A detector blends them, much like a doctor reading several test results before reaching a diagnosis. Low perplexity on its own might just mean a clear, simple writer; paired with low burstiness, generic vocabulary and a strong classifier match, the combined weight tips toward “AI-generated.”
The output is therefore a probability, not a fact — usually a percentage or a soft label such as “likely AI” or “likely human.” That distinction matters enormously in practice. A score of “82% AI” is an estimate of confidence, not proof, which is exactly why responsible institutions treat it as one signal among many rather than a verdict. In measurement terms, the same passage can score differently across tools and even across runs, so a detector’s reliability — its consistency — is itself limited. How far you can trust that number is a question in its own right, covered fully in our guide on whether AI detectors are accurate.
“Even our own AI writing detection tools are not 100% accurate and should not be used to discipline students.” — OpenAI guidance for educators, on the limits of detection.
Why Detectors Sometimes Get the Mechanics Wrong
Understanding the mechanism explains its blind spots. Because detectors only ever measure predictability and style, anything that disturbs those statistics changes the result — for better or worse:
- Light editing shifts the signals. Reordering clauses or varying sentence length raises burstiness, so lightly edited AI text can read as human to the model.
- Plain human writing looks machine-like. A non-native English writer, or anyone writing in a clear, formulaic style, produces low-perplexity text that detectors may misread — a known source of false positives.
- The reference model drifts. Detectors are tuned to the language models that exist today; as new models change their token patterns, yesterday’s detector becomes less reliable.
- Short texts carry too little signal. A handful of sentences rarely gives enough statistical variation to judge confidently.
Some writers deliberately rework AI drafts — for instance with an AI humaniser tool — specifically to raise perplexity and burstiness. We mention this only to explain why detectors are fallible by design; using such tools to disguise unattributed AI work in assessed coursework breaches most universities’ academic-integrity rules, and the honest path is always to disclose AI use where your institution requires it.
AI Detectors vs Plagiarism Checkers
The two tools are easy to confuse but mechanically very different. An AI detector asks “was this written by a human or a machine?” and works on predictability and style. A plagiarism checker asks “has this been copied from somewhere else?” and works by matching your text against a database of existing sources. One judges how text was produced; the other judges where it came from.
| Dimension | AI Detectors | Plagiarism Checkers |
|---|---|---|
| Main purpose | Estimate whether text was machine-generated | Find text copied from existing sources |
| How they work | Analyse predictability, rhythm and style | Match text against databases for overlap |
| Output | A probability or label (“likely AI”) | A similarity percentage with matched sources highlighted |
| Main weakness | False positives and edited AI text slipping through | Misses clever paraphrasing of sources |
| Best use | Screening for authenticity of authorship | Verifying originality against published work |
A subtle point: plagiarism checkers will happily pass a fully original, machine-written passage because nothing was copied, while AI detectors will flag it. The reverse is also true. That is why some platforms now run both engines side by side. If you are still learning the conventions of original academic writing, our guides on research papers and on planning dissertations are a good grounding, and tools like our own AI detector are best used as a self-check before submission rather than as a gatekeeper.
What the Mechanics Mean for Students and Writers
Once you know that detectors score predictability rather than honesty, three practical lessons follow. First, write in your own voice — genuine variety, specific examples and a real point of view naturally raise the human signals, which is also simply better writing. Second, if you do use AI as a brainstorming or outlining aid, disclose it where your course requires and make the final analysis your own; the goal of any assessment is to demonstrate your understanding. Third, never treat a detector score as a confession. The mechanics guarantee occasional misfires, and the burden of any integrity decision rightly sits with a human reviewer looking at drafts, sources and context.
If you ever find yourself wrongly flagged, the strongest defence is process evidence: version history, notes, outlines and the ability to discuss your argument in person. The detector saw a statistical shadow; your working shows the substance behind it. And if a longer project feels overwhelming, professional support through our dissertation services can help you produce original, defensible work in your own voice from the outset.
Check your writing before you submit
Run a draft through our free AI detector to see how the perplexity and burstiness signals read on your own work — a smart self-check, not a verdict.
For the closely related questions students ask most — how dependable these scores really are, and whether university systems catch AI — see our companion pieces on whether AI detectors are accurate and on whether Turnitin detects AI. Together with this mechanics explainer, they give you the full picture.