Getting More Information (Active / Passive, Tenses ...) from Tagger

I use Stanford Tagger to identify parts of speech. However, I want to get more information from the text. Is it possible to get additional information, such as time of proposal or if it is active / passive?

So far I have been using the simplest PoS-Tagging approach:

List<List<TaggedWord>> taggedUnits = new ArrayList<List<TaggedWord>>();

String input = "This sentence is going to be future. The door was opened.";
for (List<HasWord> sentence : MaxentTagger.tokenizeText(new StringReader(input)))
{
     taggedUnits.add(tagger.tagSentence(sentence));
}
+4
source share
1 answer

You can get intense information from various penn tags:

27. VB  Verb, base form
28. VBD Verb, past tense
29. VBG Verb, gerund or present participle
30. VBN Verb, past participle
31. VBP Verb, non-3rd person singular present
32. VBZ Verb, 3rd person singular present

On the active / passive aspect, you can use the typed dependencies included in the Stanford Core NLP.

  • If the sentence is in an active voice, there must be a dependency on nsubj.
  • , nsubjpass

, .

+12

All Articles