There are several issues here.
First, I must point out that there are many languages โโthat are not alphabets . Obviously, the Chinese or Japanese are examples of ideographic languages. Unfortunately, it will be very difficult, and nearby it is impossible to create a list of all the characters in these languages.
Secondly, although the Common Locale Data Repository and, as a result, ICUs have predefined sets of index examples and symbol examples, this information is far from complete.
-, , script ( ). , .
, , . ...