Topic Map Generation Using Text Mining
Karsten Böhm (TextTech Ltd., Germany)
Gerhard Heyer (Leipzig University, Germany)
Uwe Quasthoff (Leipzig University, Germany)
Christian Wolff (Leipzig University, Germany)
Abstract: Starting from text corpus analysis with linguistic and statistical analysis algorithms, an infrastructure for text mining is described which uses collocation analysis as a central tool. This text mining method may be applied to different domains as well as languages. Some examples taken form large reference databases motivate the applicability to knowledge management using declarative standards of information structuring and description. The ISO/IEC Topic Map standard is introduced as a candidate for rich metadata description of information resources and it is shown how text mining can be used for automatic topic map generation.
Keywords: corpora, knowledge management, semantic relations, text mining, topic maps
Categories: H.3.1, H.3.3, H.3.5, H.5.3, I.2.7, I.7