Question 1

What is Burrows' Delta?

Accepted Answer

Burrows' Delta is the gold-standard algorithm for stylometric authorship attribution, introduced by John Burrows in 2002. It compares the relative frequencies of the most frequent words (typically 100–500 function words like the, of, and, to, in) between two texts, normalises each text's frequencies into z-scores against a reference distribution, and reports the mean absolute z-difference. Lower numbers mean more similar styles.

Question 2

How do I read the score?

Accepted Answer

The Δ value is on a roughly continuous scale: below ~0.7 typically indicates same author with high confidence, around 1.0 is the boundary region, and above ~1.5 typically indicates different authors. The probability shown above the score is a calibrated logistic function fitted to that range — treat it as a guide, not a definitive answer.

Question 3

How long do my samples need to be?

Accepted Answer

The minimum here is 100 words per sample, but reliable results need much more — Burrows' original work used novel-length samples (50,000+ words). For forensic-style attribution, 5,000+ words per side is a reasonable floor. Below that, function-word distributions are too sparse to be stable. The tool will run on shorter input but will warn you in the explanation.

Question 4

Can it tell human-written from AI-written text?

Accepted Answer

Yes, in principle. Recent research applies stylometry to detect LLM output: GPT-class models have characteristic function-word distributions that differ from human writers. The catch: light human editing or deliberate paraphrasing significantly degrades detection. For a single signal among many, this works; as standalone proof of AI authorship, it does not. Use alongside dedicated AI-text detectors and content-provenance tools.

Question 5

Why are some common words missing from the analysis?

Accepted Answer

The MFW (most frequent words) are taken from the combined corpus of both samples. If a word is rare in both, it can't be in the MFW. The "Top 20 contributors" panel shows which words drove the score the most — these tend to be common function words whose frequencies diverge between the two writers, exactly what stylometry is designed to detect.

Question 6

How do I defeat stylometric attribution if I'm being doxed?

Accepted Answer

Style obfuscation by hand is largely effective — research shows that purposeful style change or deliberate imitation of another known author defeats most stylometric methods. Machine-translation round-tripping, contrary to folklore, doesn't work as well — translators preserve enough authorial style to leak through. The defensive guidance from the Whonix project is the canonical reference.

Question 7

Is the algorithm here the same as in academic stylometry packages?

Accepted Answer

The core delta calculation matches the standard formula. Two simplifications versus a full implementation: (1) the reference distribution is taken from the average of the two input texts rather than an external corpus, which is the standard approximation when only two texts are being compared; (2) the MFW is built from the combined corpus rather than a separate training corpus. For research-grade work, use stylo (R) or faststylometry (Python) with proper reference corpora.

✍️ Burrows Delta Stylometry

Free Burrows' Delta authorship comparator

Frequently asked questions

Free Burrows' Delta authorship comparator

Frequently asked questions

Related OSINT tools