I have suggestions for POS tags that can be obtained using the Stesford POS tester. For instance:
The / DT Island / NN was / VBD very / RB beautiful / JJ. /. I / PRP love / VBP it / PRP. /.
(xml format is also available)
Can anyone explain how to make a function selection from these POS tag sentences and convert them into a vector function for classifying text using machine learning.
An easy way to start would be something like the following (assuming the word order is not important for your classification algorithm).
. . , , . , , , , , . - /POS .
, . , - -. - , .. chi-squared, . , , 10% , .
, . , - , , . , .
, , -