When I do this:
text = u"奥巴马讲话"
for c in text:
print c
I got the expected result:
奥
巴
马
讲
话
But if I do this:
text = u"𤭢€"
for c in text:
print c
I got:
€
I expect to receive:
𤭢
€
Why is this? I think this has something to do with the following fact:
In [1]: u"𤭢".encode("utf8")
Out[1]: '\xf0\xa4\xad\xa2'
"𤭢" is encoded using 4 bytes.
How can I scroll a unicode string that has this kind of encoding?
Something like u "𤭢 𤭢 𤭢 𤭢 𤭢 𤭢".
source
share