OpenMethods

OpenMethods

HIGHLIGHTING DIGITAL HUMANITIES METHODS AND TOOLS

Menu
Skip to content
  • Home
  • About
  • Who we are
    • Editorial Team
    • Volunteer Editors
  • Join us
  • Submit a content
  • RSS feeds
  • Log in

Tag: ISO standards

category_id:711826:en

An end-to-end approach for extracting and segmenting high-variance references from pdf documents
  • Analysis

An end-to-end approach for extracting and segmenting high-variance references from pdf documents

  • Posted on November 9, 2020November 10, 2020
  • by Stefan Karcher

Introduction: Digital text analysis depends on one important thing: text that can be processed with little effort. Working with PDFs often leads to great difficulties, as Zeyd Boukhers Shriharsh Ambhore and Steffen Staab describe in their paper. Their goal is to extract references from PDF documents. Highlight of their described workflow are very impressive precision rates. The paper thereby encourages to a further development of the process and its application as a “method” in the humanities.

Read More

Interested in blogging about your research? The Digital Humanities Tools and Methods blog is for you!

In cooperation with

OPERAS

Categories

Recent Posts

  • OpenMethods Spotlights #2 : Interview with Luise Borek and Canan Hastik about TaDiRAH
  • Programmable Corpora: Introducing DraCor, an Infrastructure for the Research on European Drama
  • Worthäufigkeiten als Quelle für die Geschichtswissenschaft? – Einblicke in die Digital Humanities
  • Fragmentarium: a Model for Digital Fragmentology
  • Offen, vielfältig und kreativ. Ein Bericht zum Barcamp Data Literacy #dhddatcamp20 bei der DHd 2020 | DHd-Blog

Archives

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
© Copyright 2017-2018 – OpenMethods
Privacy Notice
Hosted by – We use
HaS has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 675570
Bezel Theme by SimpleFreeThemes ⋅ Powered by WordPress