Introduction: The indispensable Programming Historian comes with an introduction to Term Frequency – Inverse Document Frequency (tf-idf) provided by Matthew J. Lavin. The procedure, concerned with specificity of terms in a document, has its origins in information retrieval, but can be applied as an exploratory tool, finding textual similarity, or as a pre-processing tool for machine learning. It is therefore not only useful for textual scholars, but also for historians working with large collections of text.
Introduction: The rperseus package provides classicists and other people interested in ancient philology and exegesis with corpora of texts from the ancient world (based on the Perseus Digital Library), combined with a toolkit designed to compare passages and selected words with parallels where the same expressions or words occur.
Introduction: This article explains the concept, the uses and the procedural steps of text mining. It further provides information regarding available teaching courses and encourages readers to use the OpenMinTeD platform for the purpose.
Introduction: How do we improve the quality of the fledgling practice of Web archeology, so much needed now that a first decade of Web information threatens to disappear as current interest wanes but contemporaneous cultural value is undisputed. A National Library of the Netherlands scientific report investigates.
Introduction: NeMO is a conceptual framework for DH. It offers a well-founded conceptualization of scholarly work, which can function as schema for a knowledge base containing information on scholarly research activity, including goals, actors, methods, tools and resources involved.
Introduction: The author highlights the unexpected use of the on-line archives, by explaining his research and its cartographic representation.
Introduction: This post presents a technology which produces a dynamic image from photos, to improve the reading of illegible or erased signs.
Introduction: Here is the report of the 2017 symposium of two associations, Game4NLP and EACLinguistics, who shared ideas and methods for NLP research.