python 报错"UnicodeDecodeError: 'utf-8' codec can't decode byte"的解决办法 - 代码天地

python 报错"UnicodeDecodeError: 'utf-8' codec can't decode byte"的解决办法

其他 2019-05-10 23:02:50 阅读次数: 0

最近写了一个Python小程序，用来统计《三国演义》中人物出场次数的。从网上下载一个”三国演义.txt”的文件，但是后来程序运行时出现以下报错：
UnicodeDecodeError: ‘utf-8’ codec can’t decode byte 0xa1 in position 0: invalid start byte
后来经过不断查找终于找到了解决办法。

由于我在程序中设定文件打开的编码格式为“utf-8”,但是我后来用电脑的记事本打开这个”三国演义.txt”文件，然后在点击另存为的时候，发现原文件的编码方式是“ANSI”. 哦哦哦哦哦哦哦哦哦哦哦。。。。不报错才怪呢！

解决办法很简单，只需要在另存为的时候，选择编码方式为：UTF-8即可，就像下面这样

运行代码

import jieba

txt=open("threeKindoms.txt",'r',encoding='utf-8').read()
words=jieba.lcut(txt)
counts={}

for word in words:
    if len(word)==1:
        continue
    else:
        counts[word]=counts.get(word,0)+1
items=list(counts.items())
items.sort(key=lambda x:x[1],reverse=True)
for i in range(15):
    word,count=items[i]
    print("{0:<10}{1:>5}".format(word,count))

运行结果：

猜你喜欢

转载自blog.csdn.net/weixin_42686879/article/details/89495413

python 报错"UnicodeDecodeError: 'utf-8' codec can't decode byte"的解决办法

(mac) python中UnicodeDecodeError: 'utf-8' codec can't decode byte 报错

python3 error : 解决UnicodeDecodeError 'utf-8' codec can't decode byte..问题

UnicodeDecodeError: ‘utf-8’ codec can’t decode byte...

python问题：UnicodeDecodeError: 'utf-8' codec can't decode byte in position : invalid start byte

python报错：UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc3 in position 0

python 发送邮件报错UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xc4 in position 0

python3 windows utf-8运行报错UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xd3 in position 13: in

Python_报错SyntaxError: (unicode error) ‘utf-8‘ codec can‘t decode byte ...

Python3解决UnicodeDecodeError: 'utf-8' codec can't decode byte..问题终极解决方案

Python3解决UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte..问题终极解决方案

pycharm debug出现UnicodeDecodeError: 'utf-8' codec can't decode 解决办法

python 网络爬虫报错“UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position”解决方案

python3 网络爬虫报错“UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position”解决方案

python利用pandas读取csv报错：UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc8...解决方法

Python3错误：UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd5 解决方法

Python 编码问题：UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa8 in position

Python报错：UnicodeDecodeError: 'gbk' codec can't decode byte ...

python问题--UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 0: invalid start byte

Python读取文件UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbc in position 2: invalid start byte

python UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbd in position 0: invalid start byte

python系列之:UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xff in position 64:invalid start byte

解决Django:UnicodeDecodeError: 'utf-8' codec can't decode byte 0xcb in position 325

(2020.1.2已解决)pyinstaller || UnicodeDecodeError:'utf-8' codec can't decode byte Oxce in position 118

python3 报错：UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd6 in position 201: invalid continuation byte

【python】UnicodeDecodeError: 'utf-8' codec can't decode byte 0xce in position 130: invalid continuat

python3 UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc9 in position 167

Python：出现UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc9 in position 0: invalid co

python error：UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa1 in position 0

mysql-connector-python取二进制字节时报错UnicodeDecodeError:'utf-8' codec can't decode byte 0xb0 in position 0

今日推荐

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

最强开源大模型 Llama 3 上架 Gitee AI

周排行

自媒体文章如何提高原创度以及如何检测原创度

开启qq邮箱的smtp服务

Qt程序单次启动（QSingleApplication类）

国外的外包网站

更新IDEA主题——放飞代码风格

cocos2dx 实现搓牌效果（翻牌效果），包括铺平动画

dict和json之间的互相转换

angular的一些思考

. Fibonacci数列是这样定义的： F[0] = 0 F[1] = 1 for each i ≥ 2: F[i] = F[i-1] + F[i-2] 因此，Fibonacci数列就形如：0, 1

洛谷P1064 金明的预算方案

每日归档

更多

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)

2024-04-16(70)