I still play with the Stanford CoreNLP, and I come across strange results on a very trivial Coreference resolution test.
Given two sentences:
The hotel had a large bathroom. It was very clean.
I would expect that the โThisโ in sentence 2 would be bound to the โbathโ code, or at least the โlarge bathโ of sentence 1.
Unfortunately, this indicates a "hotel", which, in my opinion, is erroneous.
Is there any way to solve this problem? Do I need to train something or should it work out of the box?
Annotation a = getPipeline().getAnnotation("The hotel had a big bathroom. It was very clean."); System.out.println(a.get(CorefChainAnnotation.class));
output:
{1 = CHAIN1 - ["Hotel" in sentence 1, "This" in sentence 2], 2 = CHAIN โโ2 - ["large bathroom" in sentence 1]}
Thank you very much for your help.
source share