I wonder if there is something like “official conformance tests” from the Unicode consortium (or a related organization) that allows you to test and verify text-related algorithms and implementations.
Is there something similar?
UCD contains several test data files . They provide data in standard formats that can be used to test the implementation of Unicode algorithms.
You can get the latest Unicode character database here .
Of course, there are many, just get them from within the ICU .
I just hope you are interested in C / C ++ or Java :)