What is the standard part of the nltk speech tag set?

While experimenting with some of the NLTK replication tags, I noticed a lot of VBP tags in the nltk.pos_tag my nltk.pos_tag calls. I noticed that this tag is not part of the Brown Case tags. This, however, is part of the UPenn tags.

What label set does nltk use by default? I can not find this in official documentation or apidocs.

+7
source share
2 answers

Ntlk uses the PennTreebank tag set. Take a look at this link http://nltk.org/api/nltk.tag.html

+8
source

POS tags used in the Penn Treebank project are used. You can see a list of tags that have the value " http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html "

+4
source

All Articles