How AI Detection Works - originalaichecker

01 / The Basics

What is AI content detection?

AI content detection is the process of analyzing text to determine whether it was written by a human or generated by a large language model (LLM) such as ChatGPT, Claude, or Gemini. Unlike plagiarism detection — which compares text against a database of known sources — AI detection analyzes the statistical and linguistic properties of the text itself.

The core insight is simple: AI models and humans produce text in systematically different ways. When a language model generates text, it is constantly predicting the next most likely word given everything that came before. The result is output that is grammatically clean, structurally uniform, and statistically predictable. Human writing, by contrast, is messier, more varied, and harder to predict — even when it is technically well-written.

AI detectors exploit this difference by measuring how “AI-like” the statistical patterns in a piece of text are. The two primary signals are perplexity (how predictable the text is) and burstiness (how varied the sentence structure is).

Signal 01

Perplexity

How surprised is a language model when it reads this text? Low perplexity = the text was predictable = likely AI-generated.

Signal 02

Burstiness

How much does sentence length and complexity vary throughout the text? Low burstiness = uniform structure = likely AI-generated.

Signal 03

Model Signatures

Each AI model leaves specific stylistic fingerprints. The detector recognizes patterns specific to GPT, Claude, Gemini, and others.

02 / Core Signal

Perplexity: the predictability signal

Perplexity is a statistical measure borrowed from information theory. In the context of AI detection, it answers the question: how surprised is a language model when it encounters this text?

To understand why this matters, consider how LLMs work. A model like GPT is trained to predict the next word in a sequence. When it generates text, it selects high-probability word sequences — the words that “make sense” given the context. This produces fluent, grammatically correct output, but it also means the text follows predictable patterns.

When a detection model reads AI-generated text, it assigns low perplexity scores — meaning the text is highly predictable. When it reads human writing, it assigns higher perplexity scores, because humans make more unexpected word choices, introduce tangents, use colloquialisms, and generally deviate from the most statistically likely phrasing.

Perplexity in practice

Here’s how perplexity scores distribute across different types of writing. Lower scores indicate higher predictability (more likely AI-generated).

Perplexity distribution by writing type (illustrative)

Human (casual)

High

Human (academic)

Med-High

ESL human writing

Medium

Edited AI text

Medium

Raw AI output

Low

Raw AI output (Gemini)

Very Low

Notice that ESL (English as a Second Language) writing and edited AI text overlap in the medium range — this is one of the core reasons AI detectors produce false positives on non-native speakers. The uniformity that characterizes their writing style is statistically similar to lightly edited AI output.

Key takeaway: Perplexity alone is not enough for reliable detection. A text can have low perplexity because it’s AI-generated, but also because the writer is a non-native speaker or is working in a highly formal register. That’s why burstiness and model signatures are combined with perplexity to produce a final score.

03 / Core Signal

Burstiness: the rhythm signal

Burstiness measures how much variation exists in sentence length and complexity throughout a text. The concept is borrowed from linguistics and network science, where “bursty” phenomena are those that cluster and vary rather than distributing uniformly.

Human writing is naturally bursty. In a single paragraph, a human writer might produce a long, complex, subordinate-clause-heavy sentence, followed by a blunt three-word fragment. The rhythm shifts. Emphasis is created by contrast. This natural variation is hard for AI to replicate — at least without specific prompting.

AI models default to producing text with consistent sentence structure and length. The variation in a 500-word GPT response tends to fall within a narrow range. Sentences are grammatically complete, similarly structured, and rarely burst into fragments or unexpectedly complex run-ons.

Why burstiness catches AI that perplexity misses

Some AI prompting techniques specifically address perplexity. A user can instruct an AI to “write in a more casual, varied style” and reduce perplexity significantly. But sentence-level burstiness is harder to prompt away — the model still tends to produce sentences within a consistent structural range, even when the vocabulary becomes more informal.

This makes burstiness a complementary signal to perplexity. When both measures are low, detection confidence increases significantly.

Human writing

“I’ve been staring at this dataset for three hours. Something’s off — the Q3 numbers don’t match what marketing reported, and frankly I’m not sure anyone checked. The variance across regions is also strange: EMEA is up 40%, but APAC dropped. Why? No one seems to know.”

High burstiness: sentence lengths vary widely (3 words → 22 words). Fragments used for emphasis. First-person, informal, emotionally present.

AI writing

“Upon reviewing the dataset, several notable discrepancies were identified across the Q3 reporting period. The figures provided by the marketing department do not align with the recorded values. Regional performance also showed significant variance, with EMEA demonstrating a 40% increase while APAC experienced a decline.”

Low burstiness: all sentences fall in a similar length range. Passive constructions throughout. No emotional register or structural contrast.

04 / Advanced Detection

Model signatures: identifying which AI wrote it

Beyond perplexity and burstiness, different AI models leave distinct stylistic fingerprints in the text they generate. These signatures emerge from differences in training data, reinforcement learning from human feedback (RLHF), and the specific objectives each model is optimized for.

Detection engines trained against reference outputs from multiple models can identify these patterns and flag which model is most likely responsible for the text — not just whether AI was involved at all.

GPT

OpenAI

Highly structured output, strong use of transitional phrases (“Furthermore,” “In conclusion”), tendency toward listicle-style paragraphs. Among the most detectable models.

Claude

Anthropic

More conversational than GPT models, uses hedging language frequently, tends toward longer paragraphs with nuanced qualifications. Slightly harder to detect than GPT.

Gemini

Google

More varied in structure than older GPT models, but still shows consistent sentence rhythms. Tends toward factual framing and lower emotional register.

Mistral / Llama

Open-source

Smaller models often show cruder transitions and more repetitive phrasing. Harder to detect after fine-tuning, which can shift stylistic patterns significantly.

Model signatures are probabilistic, not deterministic. As models are updated and improved, their fingerprints shift — which is why detection engines need to be continuously updated against new model versions and release families.

Important caveat: Model identification is the least reliable component of AI detection. While the detection engine includes reference signatures for major models, confidence in model attribution is always lower than confidence in the overall AI probability score. Model identification should be treated as indicative, not definitive.

05 / The Engine

How the final score is calculated

The AI probability score returned by this tool is a composite of all three signals — perplexity, burstiness, and model signature matching — combined using a weighted classifier trained on a labeled dataset of human-written and AI-generated text.

01

Text tokenization

The submitted text is broken into tokens (roughly corresponding to words and sub-words). Sentence boundaries are identified for burstiness analysis.

02

Perplexity scoring

The text is run through a language model to calculate token-level perplexity scores. These are aggregated and compared to baseline distributions for AI and human writing.

03

Burstiness calculation

Sentence lengths and syntactic complexity scores are computed. The variance across the document is measured and compared to expected distributions for human vs AI authorship.

04

Model signature matching

The text’s stylistic profile is compared against reference signatures for GPT, Claude, Gemini, Mistral, and Llama-family models. The closest match is identified.

05

Score aggregation

All signals are combined using a trained classifier. The output is a probability score from 0–100% representing the likelihood the text was AI-generated. Sentence-level scores are calculated for the detailed breakdown.

Score interpretation: 0–39% = Likely human-written · 40–69% = Mixed or uncertain · 70–100% = Likely AI-generated. Scores in the 40–69% range are the least reliable — they may indicate edited AI text, ESL writing, or genuinely mixed-authorship documents.

06 / Honest Assessment

Known limitations you should understand

No AI detector is accurate 100% of the time. Understanding where these tools fail helps you use them responsibly — particularly in high-stakes contexts like academic integrity or employment screening.

Limitation	Why it happens	What to do
ESL false positives	Non-native English writing is grammatically uniform and low-burstiness — overlapping with AI patterns	Treat elevated scores with extra caution; supplement with manual review
Edited AI text scores low	Human editing introduces burstiness and unpredictability that erodes AI fingerprints	Look for unedited passages; heavily rewritten AI is effectively human for detection purposes
Short text unreliable	Under ~150 words, there isn’t enough statistical signal to produce a meaningful score	Always analyze 250+ words for reliable results; single paragraphs are insufficient
Formal human writing flags as AI	Legal, medical, or highly formal academic writing can have low perplexity and burstiness	Apply contextual judgment; domain matters when interpreting scores
New models delay detection	Detection engines are trained on existing model outputs; new model releases create a lag period	Detection engines are updated regularly as new models are released
Anti-detection paraphrasing	Tools designed to “humanize” AI text specifically target detection fingerprints	Humanized text may score lower; no detector fully catches adversarially modified output

Do not use AI detection scores as sole evidence in disciplinary or legal proceedings. These tools are probabilistic, not forensic. Scores should be treated as one data point among many — not as a definitive determination of authorship.

07 / FAQ

Common questions

No. Perplexity is a statistical measure of predictability, not a quality assessment. A text can be grammatically perfect and stylistically excellent while still having high perplexity — meaning it contains unexpected word choices that a language model would not have predicted. Similarly, text can have low perplexity (be highly predictable) while being grammatically flawed. Perplexity is about statistical patterns, not correctness.

It is possible to reduce the AI probability score through significant editing or by using paraphrasing tools. However, substantial human rewriting is effectively equivalent to human authorship for the purposes of detection — by the time you’ve fully rewritten a passage in your own voice, the text is largely yours. Automated “humanizer” tools that target detection fingerprints can reduce scores, but detectors are updated to account for common humanization patterns.

Different detectors use different underlying models, training datasets, and signal weighting. A text might score 45% on one detector and 72% on another because they’re measuring different things, or weighting perplexity and burstiness differently, or using different reference distributions. This is a known limitation of the field — AI detection is not a standardized or regulated technology, and there is no universal ground truth for borderline cases.

This tool is optimized for English-language text. Detection accuracy decreases significantly for non-English text because the underlying models and reference datasets are English-centric. Some multilingual detection capabilities exist, but they are less reliable than English detection and should be treated with additional caution.

Detection accuracy has improved significantly since the first detectors appeared in 2022. However, there is a fundamental arms-race dynamic: as detection improves, AI models are updated in ways that make their output harder to detect, and vice versa. The gap between AI capability and detection capability is unlikely to close entirely. This is why detection should always be used alongside human judgment rather than as a standalone determination.

How AI Detection
Actually Works

What is AI content detection?

Perplexity: the predictability signal

Perplexity in practice

Burstiness: the rhythm signal

Why burstiness catches AI that perplexity misses

Model signatures: identifying which AI wrote it

How the final score is calculated

Known limitations you should understand

Common questions

Put the theory to the test

How AI DetectionActually Works

What is AI content detection?

Perplexity: the predictability signal

Perplexity in practice

Burstiness: the rhythm signal

Why burstiness catches AI that perplexity misses

Model signatures: identifying which AI wrote it

How the final score is calculated

Known limitations you should understand

Common questions

Put the theory to the test

How AI Detection
Actually Works