I filter chat messages in the chat system, where it is desirable to bind strings to Latin-1 English. Users tend to use creative input, for example.
ΓΓ²Γ³gΔ«ΔΒ§
instead
Boogies
In Java, there are Unicode normalization methods that can remove diacritics, but I'm more interested in normalizing letter forms in English and the Latin-1 character set.
Are there any tables, libraries, or methods that can display common Unicode characters outside Latin-1 in their nearest forms visually? For instance.
Γ -> B Β§ -> S Β₯ -> Y Β€ -> o
I suspect the answer is "No, that would be too big, just filter them all out," but I can hope ...
source share