So I have a line:
amélie
In bytes it is b'ame\xcc\x81lie'
In utf-8, the character combines a sharp accent for the previous character http://www.fileformat.info/info/unicode/char/0301/index.htm
u'ame\u0301lie'
When I do: 'amélie'.title () on this line, I get "AméLie", which makes no sense to me.
I know I can do a workaround, but is this intentional behavior or a mistake? I would expect l to NOT work.
another experiment:
In [1]: [ord(c) for c in 'amélie'.title()]
Out[1]: [65, 109, 101, 769, 76, 105, 101]
In [2]: [ord(c) for c in 'amélie']
Out[2]: [97, 109, 101, 769, 108, 105, 101]
source
share