The distance levenshtein is the relative meaning between the two words. Comparing LD with length does not matter, for example
cat โ scat = 1 (similar to 75%)
difference โ differences = 1 (90% similar?)
Both of these words have left distances of 1, i.e. differ by one character, but compared to their length, the second set seems more "similar."
I use soundexing to rank words that have the same left distance, e.g.
cat and fat both have LD 1 relative to kat , but when using soundex, the word is more likely to be kat than fat, assuming the word is spelled incorrectly and not incorrectly typed!)
So the short answer is simply using the left distance to determine the similarity.
James westgate
source share