How Else to Create Lemmatized Text for Topic Modeling

https://openmethods.dariah.eu/2017/08/30/how-else-to-create-lemmatized-text-for-topic-modeling-gods-graves-and-graphs/ OpenMethods introduction to: How Else to Create Lemmatized Text for Topic Modeling 2017-08-30 07:12:34 Introduction: This post reviews another post on annotations and text preparations for Topic Modeling. Delphine Montoliu http://senereko.hypotheses.org/11 Blog post Annotating Assessing Capture Code Commenting Concordancing Conversion Creation Data Data Recognition Debugging Dissemination Emulation Encoding English Enrichment File Give Overview Interaction Language Linked open data Meta-Activities Methods Migration Named Entity Recognition Persons Programming Replication Research Objects Research Techniques Sequence Alignment Sharing Text Tools Topic Modeling Transcription Translation Web development Writing via bookmarklet

Introduction by OpenMethods Editor (Delphine Montoliu): This post reviews another post on annotations and text preparations for Topic Modeling.

Here it is at last, the first post in the new blog for the SeNeReKo project. On this blog, we will write about aspects of our work, ongoing experiments and lessons learned. Our work heavily builds on the work of others, and we hope it will be of use beyond our project as well. That is why I want to open this blog with a response to another blog post.

 

In his post ‘How to Create Lemmatized (French) Text for Topic Modeling’, Christof described how to make use of the TreeTagger output when preparing texts for Topic Modeling. As a by-product of our own work in SeNeReKo, we aimed for a more generic and hopefully simpler approach to deal with annotations of this kind.

 

Original publication date: 12/07/2014.

Source: How Else to Create Lemmatized Text for Topic Modeling | Gods, Graves and Graphs