I would like to create a custom NER model. What I've done:
DATA TRAINING (stanford-ner.tsv):
Hello O ! O My O name O is O Damiano PERSON . O
PROPERTIES (stanford-ner.prop):
trainFile = stanford-ner.tsv serializeTo = ner-model.ser.gz map = word=0,answer=1 maxLeft=1 useClassFeature=true useWord=true useNGrams=true noMidNGrams=true maxNGramLeng=6 usePrev=true useNext=true useDisjunctive=true useSequences=true usePrevSequences=true useTypeSeqs=true useTypeSeqs2=true useTypeySequences=true wordShape=chris2useLC useGazettes=true gazette=gazzetta.txt cleanGazette=true
GAZZETTE gazzetta.txt):
PERSON John PERSON Andrea
I will build the model through the command line using
java -classpath "stanford-ner.jar:lib
And check with:
java -classpath "stanford-ner.jar:lib
I did two tests with the following texts:
→> TEST 1 <
→> TEST 2 <
As you can see, the object "Damiano" was found. This object is in my training data, but John (second test) is inside the newspaper. So the question is.
Why is the entity John not recognized?
Thank you so much in advance.
source share