6, character encoding

Original link: http://www.cnblogs.com/hlc-123/p/10915232.html

The first coding table support Chinese GB2312 (1980 years) to support 7445 characters, including 6,763 Chinese and 632 and other characters;
GBK1.0 (1995 years) Character area contains 21,003 characters;
GB18030 (2000 years) contains 27,484 Chinese characters and other minority languages;
Unicode: Unicode encoding support all countries and regions of the
2 ** 16 = 65535 keep a unified character occupies two bytes
UITF-8: Unicode is a superset of variable-length character encoding set
English one byte characters use two bytes of Europe to save the character of East Asia three bytes
Assic -> GB2313 -> GBK1.0 -> GB18030
Assic -> Unicode -> UTF-8

python2.X default Assic coding
python3.X default unicode will be used

windowds changed GBK: chcp 936

Reproduced in: https: //www.cnblogs.com/hlc-123/p/10915232.html

Guess you like

Origin blog.csdn.net/weixin_30648587/article/details/95283022