How to use GermaNet (German WordNet correspondent) with R

Question

How to use GermaNet (German WordNet correspondent) with R

I want to use GermaNet to lemmatize (corresponding to getLemma() in WordNet) a list (actually DTM terms - - to improve text classification efficiency). But I could not find a hint or R-package for GermaNet. Somehow you can still use it in R?

+6

r lemmatization wordnet

alex Mar 19 '14 at 4:40

source share

1 answer

user3554004 · Answer 1 · 2015-07-09T16:02:33+0000

I assume that you have access to unprocessed files that store wordnet data (perhaps a free license is provided for the German suite). You can parse them (just using some nifty regular expressions) and extract the information you need (I don’t know exactly what the DTM is, but I suppose this is due to synchronizations or the links between them). The words (non-German) that I worked on were organized in several files, some of which contain links, some information in the form, for example

 0 @ 1@ WORD_MEANING 1 PART_OF_SPEECH "v" 1 VARIANTS 2 LITERAL "someverb" 3 SENSE 7 3 DEFINITION "adefinition" 3 EXAMPLES 4 EXAMPLE "anexample" 3 EXTERNAL_INFO ...

It should not be too hard to make out.

How to use GermaNet (German WordNet correspondent) with R

More articles: