Question 1

What is a stylometry analysis tool?

Accepted Answer

Stylometry measures an author’s linguistic fingerprint — word choice, sentence structure, and punctuation patterns — to test whether two texts share an author. This tool runs that comparison in your browser.

Question 2

How reliable is stylometric analysis?

Accepted Answer

Academic studies achieve 85-95% accuracy for authorship attribution when using sufficient sample sizes (2,500+ words) from a closed set of candidate authors. This tool provides an indicative similarity score, not a forensic certainty. It is most useful for generating investigative leads — flagging potential common authorship that can then be corroborated with other evidence. A high score suggests common authorship is plausible; a low score suggests different authors are likely.

Question 3

Can an author deliberately disguise their style?

Accepted Answer

Partially. Conscious features like vocabulary and sentence length can be altered with effort. But function word usage, punctuation micro-patterns, and contraction habits are deeply ingrained and very difficult to consistently disguise across long texts. Studies show that deliberate obfuscation is detectable in itself — the resulting text often shows unnatural statistical properties. The analyzer examines multiple independent feature categories, making comprehensive disguise extremely difficult.

Question 4

What kind of text works best?

Accepted Answer

Natural, unconstrained prose works best — blog posts, forum comments, personal emails, essays, articles. Avoid: quoted material from other sources, heavily edited or collaborative texts, poetry or fiction (which may involve deliberate style shifts), extremely short texts (under 200 words), and machine-generated or heavily template-driven content. Both samples should ideally be from similar genres, though the function word analysis is fairly genre-independent.

Question 5

Does this tool store or transmit my text?

Accepted Answer

No. All analysis runs entirely in your browser using client-side JavaScript. Your text never leaves your device. No data is stored in cookies or localStorage. When you close the page, everything is gone. This makes it safe for analyzing sensitive or confidential texts.

Question 6

How is the overall similarity score calculated?

Accepted Answer

The score is a weighted average of normalized similarities across feature categories: function word cosine similarity (30% weight — the strongest authorship signal), vocabulary metrics (20%), sentence structure (15%), punctuation patterns (15%), readability indices (10%), and n-gram overlap (10%). Each feature's contribution is displayed in the breakdown table so you can see which dimensions drive the score.

Stylometry Analyzer

Stylometry Analyzer — Forensic Authorship Attribution

How It Works

Function Words — The Strongest Signal

Minimum Sample Size

✍️ Stylometry — Frequently Asked Questions

Stylometry Analyzer — Forensic Authorship Attribution

How It Works

Function Words — The Strongest Signal

Minimum Sample Size

✍️ Stylometry — Frequently Asked Questions

Related OSINT Tools

Ghost Finder