How to use GermaNet (German WordNet correspondent) with R

I want to use GermaNet to lemmatize (corresponding to getLemma() in WordNet) a list (actually DTM terms - - to improve text classification efficiency). But I could not find a hint or R-package for GermaNet. Somehow you can still use it in R?

+6
source share
1 answer

I assume that you have access to unprocessed files that store wordnet data (perhaps a free license is provided for the German suite). You can parse them (just using some nifty regular expressions) and extract the information you need (I don’t know exactly what the DTM is, but I suppose this is due to synchronizations or the links between them). The words (non-German) that I worked on were organized in several files, some of which contain links, some information in the form, for example

 0 @ 1@ WORD_MEANING 1 PART_OF_SPEECH "v" 1 VARIANTS 2 LITERAL "someverb" 3 SENSE 7 3 DEFINITION "adefinition" 3 EXAMPLES 4 EXAMPLE "anexample" 3 EXTERNAL_INFO ... 

It should not be too hard to make out.

+1
source

All Articles