Introduction: Given in French by Mathieu Jacomy – also known for his work on Gephi, this seminar presentation gives a substantial introduction to Hyphe, an open-source web crawler designed by a team of the Sciences Po Medialab in Paris. Specifically devised for the researchers’ use, Hyphe helps collecting and curating a corpus of web pages, through an easy to handle interface.
Introduction: Studying n-grams of characters is today a classical choice in authorship attribution. If some discussion about the optimal length of these n-grams have been made, we have still have few clues about which specific type of n-grams are the most helpful in the process of efficiently identifying the author of a text. This paper partly fills that gap, by showing that most of the information gained from studying n-grams of characters comes from the affixes and punctuation.
Introduction: This blog post presents “TEI Simple”, a framework developed to ensure a simpler interaction between TEI and other formats, and to enable easier customization.
Introduction: This article describes the possibilities offered by the ggplot2 package for network visualization. This R package enables the user to use a wide variety of graphic styles, and to include supplementary information regarding vertices and edges.
Introduction: This post explains the necessary lemmatization process for topic modelling on French or European texts with Mallet.
Introduction: This post outlines the benefits of using a statistical software such as R for data analysis and visualization in DH, through the study of a correspondence network.