index.html is most likely encoded with non-UTF-8, possibly ISO-8859-1 or Windows-1252 . The hex editor is good to use in these cases to learn how รถ et.c. are stored.
If index.html , where in UTF-8 , รถ will match two bytes, c3 b6 . If it is ISO-8859-1, it will be one byte of f6 .
To solve this problem, transcode the file to UTF-8 or select the desired codec.
Anders lindahl
source share