I have some rough ideas - for example, dealing with a singular / plural, two or more words / phrases that mean the same thing, typos, etc. But I'm not sure of any patterns or rules of thumb to solve these problems. programmatically and automatically, or by submitting them to administrators or even users for cleaning.
Any thoughts or suggestions?
source share