According to this , SQL Server 2K5 uses UCS-2 internally. It can store UTF-16 data in UCS-2 (with corresponding data types, nchar, etc.), however, if there is an extra character, it is saved as 2 UCS-2 characters.
This leads to obvious problems with string functions, namely that what is a single character is treated by SQL Server as 2.
I am somewhat surprised that SQL Server is basically only capable of handling UCS-2, and moreover, this is not fixed in SQL 2K8. I really appreciate that some of these characters may not be so common.
In addition to the features suggested in this article, any suggestions on the best approach to working with (damaged) UTF-16 string functions and data in SQL Server 2K5.
David Cameron
source
share