Introduction: Apart from its buoyant conclusion that authorship attribution methods are rather robust to noise (transcription errors) introduced by optical character recognition and handwritten text recognition, this article also offers a comprehensive read on the application of sophisticated computational techniques for testing and validation in a data curation process.
Introduction: The rperseus package provides classicists and other people interested in ancient philology and exegesis with corpora of texts from the ancient world (based on the Perseus Digital Library), combined with a toolkit designed to compare passages and selected words with parallels where the same expressions or words occur.
Introduction: Know Your Implementation: Subgraphs in Literary Networks shows how the online tool ezlinavis can give account of detached subgraphs while working with network analysis of literary texts. For this specific case, Goethe’s Faust, Part One (1808) was analyzed and visualized with ezlinavis, and average distances were calculated giving some new results to this research in relation to Faust as protagonist.
Introduction: This post explains the necessary lemmatization process for topic modelling on French or European texts with Mallet.
Introduction: The post discusses the challenges that traditional philological approach has to face in creating digital corpora of critical editions of nonvernacular medieval works.
Introduction: This article reflects on the lessons learnt by the author as he first taught a graduate course in digital analysis of literary texts. He stresses the importance of methodologies over technologies, the need for well-curated, community-created teaching datasets and the implications of the practical, discipline-based organisation of the curricula.
Introduction: A review of the book BITECA: Bibliografia de textos antics catalans, valencians i balears: Biblioteques i Arxius Valencians, by Beltran, Avenoza & Soriano (2013), that is an excuse to explain the technologies used to work on the first Dictionary of the Old Spanish Langauge (DOSL) and other versions at the Hispanic Seminary of Medieval Studies (HSMS).
Introduction: This software paper in Polish describes “Magik” (Magician), a tool for textual scholars which allows for comparisons of different variants of the same text.
Introduction: This software paper describes ‘stylo’ – an R package for stylometric research and text processing.
Introduction: This article traces complex genealogy of distant reading to social-scientific approaches in literary studies.