There are slight differences in UCS2 and UCS4 (e.g. Windows and Linux), Python builds due to errors, conflicting or outdated standards, etc.
unicodetest.py :
#-*- coding: utf-8 -*- print 'Result:', u'π£' == u'\U00010123' print 'Len:', len(u'π£'), len(u'\U00010123') print 'Repr:', repr(u'π£'), repr(u'\U00010123')
Output (Python 2.6, Linux):
Result: False Len: 2 1 Repr: u'\ud800\udd23' u'\U00010123'
source share