The C locale will be executed. UTF-8 is designed so that the byte order is also a code order. This is not trivial, but consider how UTF-8 works:
Number range Byte 1 Byte 2 Byte 3
0000-007F 0xxxxxxx
0080-07FF 110xxxxx 10xxxxxx
0800-FFFF 1110xxxx 10xxxxxx 10xxxxxx
When sorting aka C locale binary data, the first unequal byte will work with etermine. What we wanted to see, if the two numbers encoded in UTF-8 are different, then the first unequal byte will be lower for a lower value. If the numbers are in different ranges, then the first byte will indeed be lower for the lower number. Within the same range, the order is determined literally by the same bits as without encoding.
chx
source share