GerIE - An Open Information Extraction System for the German Language
Akim Bassa (Unycom GmbH, Austria)
Mark Kroll (Know-Center, Austria)
Roman Kern (Know-Center, Austria)
Abstract: Open Information Extraction (OIE) allows to extract relations from a text without the need of domain-speci_c training data. To date, most of the research on OIE has been focused to the English language and little or no research has been conducted on other languages, including German. To tackle this problem, we developed GerIE, an OIE system for the German language. We surveyed the literature on OIE in order to identify concepts that may apply to the German language. Our system is based on the output of a German dependency parser and a number of handcrafted rules to extract the propositions. To evaluate the system, we created two dedicated datasets: one derived from news articles and the other devised from texts from an encyclopedia. Our system achieves F-measures of up to 0.89 for correctly-preprocessed sentences.
Keywords: German language, fact extraction, open information extraction
Categories: I.2.7, I.7.m