I noticed that without declaring the encoding of the source code, the Python 2 interpreter assumes that the source code is encoded in ASCII with scripts and standard input:
$ python test.py # where test.py holds the line: print u'é' File "test.py", line 1 SyntaxError: Non-ASCII character '\xc3' in file test.py on line 1, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details $ echo "print u'é'" | python File "/dev/fd/63", line 1 SyntaxError: Non-ASCII character '\xc3' in file /dev/fd/63 on line 1, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details
and encoded in ISO-8859-1 with the -m and -c module command flags:
$ python -m test # where test.py holds the line: print u'é' é $ python -c "print u'é'" é
Where is this documented?
Contrast this with Python 3, which always assumes that the source code is UTF-8 encoded and thus prints é on four occasions.
Note. - I tested this on CPython 2.7.14 on both macOS 10.13 and Ubuntu Linux 17.10 with console encoding installed in UTF-8.
source share