Introduction by OpenMethods Editor (Delphine Montoliu): This post reviews another post on annotations and text preparations for Topic Modeling.
Here it is at last, the first post in the new blog for the SeNeReKo project. On this blog, we will write about aspects of our work, ongoing experiments and lessons learned. Our work heavily builds on the work of others, and we hope it will be of use beyond our project as well. That is why I want to open this blog with a response to another blog post.
In his post ‘How to Create Lemmatized (French) Text for Topic Modeling’, Christof described how to make use of the TreeTagger output when preparing texts for Topic Modeling. As a by-product of our own work in SeNeReKo, we aimed for a more generic and hopefully simpler approach to deal with annotations of this kind.
Original publication date: 12/07/2014.