We have problems with utf8 string comparisons in MySQL 5 regarding case and emphasis:
from what I have gathered, that MySQL implements mappings, assuming that "groups of characters should be considered equal."
For example, when sorting, utf8_unicode_ciall the letters "EÉÈÊeéèê" are in the same field (along with other variants of "e").
So, if you have a table containing ["video", "vidéo", "vidÉo", "vidÊo", "vidêo", "vidÈo", "vidèo", "vidEo"] (in the varchar column declared with ut8_general_ci ):
- when MySQL requests sorting the rows according to this column, the sorting is random (MySQL does not apply the sorting rule between "é" and "É", for example),
- when MySQL requests to add a unique key to this column, it causes an error because it considers that all values are equal.
What settings can we handle to fix these two points?
PS: in the corresponding note I do not see any case-sensitive encoding for utf8 encoding. Did I miss something?
[edit] I think that my initial question is still of some interest, and I will leave it as it is (and perhaps someday we will get a positive answer).
, , . character_set_client MySQL, latin1.
, :
MySQL
, , , , .