Short Samples in Authorship Attribution

https://openmethods.dariah.eu/2017/09/20/microsoft-word-341-eder-short-samples-in-authorship-attribution-341-docx-341-pdf/ OpenMethods introduction to: Short Samples in Authorship Attribution 2017-09-20 17:54:53 Introduction: This article discusses the question of minimal sample size in stylometry setting it up as low as 2,000 words in some cases. Maciej Maryl Blog post Analysis Capture Data Data Recognition Distance Measures English Interpretation Language Literature Methods Research Activities Research Objects Research Results Research Techniques Stilistic Analysis Text via bookmarklet

Introduction by OpenMethods Editor (Maciej Maryl): This article discusses the question of minimal sample size in stylometry setting it up as low as 2,000 words in some cases.

The study was aimed at re-considering the minimum sample size for reliable authorship attribution. The results of the experiments suggest that a sufficient amount of textual data may be as little as 2,000 words in many cases. However, sometimes the authorial fingerprint is so vague, that one needs to use substantially longer samples to make the attribution feasible. A question of some importance is to which category an unknown (disputed) text belongs.

 

Source: Microsoft Word – 341. Eder-Short Samples in Authorship Attribution-341.docx – 341.pdf