python 识别图片中的汉字 - 代码天地

python 识别图片中的汉字

其他 2019-06-02 07:44:46 阅读次数: 0

我们就识别上面的汉字。

安装软件tesseract和python库

https://www.cnblogs.com/sea-stream/p/10961580.html

然后新建一个文件夹test,把上面那张图片放在文件夹里面，再新建一个test文件

写入如下内容

#coding=utf-8
from PIL import Image
import pytesseract
#上面都是导包，只需要下面这一行就能实现图片文字识别
text=pytesseract.image_to_string(Image.open('xxx.png'),lang='chi_sim')
print(text)

目录如下：

运行可能会出现错误：

C:\Users\k\Desktop\test>python test.py
Traceback (most recent call last):
  File "test.py", line 5, in <module>
    text=pytesseract.image_to_string(Image.open('xxx.png'),lang='chi_sim')
  File "C:\Users\k\Anaconda3\lib\site-packages\pytesseract\pytesseract.py", line 309, in image_to_string
    }[output_type]()
  File "C:\Users\k\Anaconda3\lib\site-packages\pytesseract\pytesseract.py", line 308, in <lambda>
    Output.STRING: lambda: run_and_get_output(*args),
  File "C:\Users\k\Anaconda3\lib\site-packages\pytesseract\pytesseract.py", line 218, in run_and_get_output
    run_tesseract(**kwargs)
  File "C:\Users\k\Anaconda3\lib\site-packages\pytesseract\pytesseract.py", line 194, in run_tesseract
    raise TesseractError(status_code, get_errors(error_string))
pytesseract.pytesseract.TesseractError: (1, 'Error opening data file C:\\Program Files (x86)\\Tesseract-OCR/tessdata/chi_sim.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory. Failed loading language \'chi_sim\' Tesseract couldn\'t load any languages! Could not initialize tesseract.')

因为tesseract-ocr默认不支持中文识别。将下载到的文件：chi_sim.traineddata 放到Tesseract-OCR安装目录 D:\Program Files (x86)\Tesseract-OCR\tessdata 下

链接：https://pan.baidu.com/s/1c-fveIYnm1sQHxX9WRpUZw
提取码：9ovq

再次运行

python test.py

下面是输出结果

C:\Users\k\Desktop\test>python test.py
风急天高猿啸衷′ 渚麦冒麦少丑弓飞口。
u边洛木萧萧下′ 不〖长江滚滚来。
万 悲禾火常作畜′ 年多病独登台。
艰难苦恨萦霜 渣倒新停澍酉木不=

参考：

https://www.cnblogs.com/lizhe860/p/8969171.html

https://blog.csdn.net/showgea/article/details/82656515

猜你喜欢

转载自www.cnblogs.com/sea-stream/p/10961744.html

python 识别图片中的汉字

python识别图片中的代码。

Python识别图片中的文字

python切图并识别图片中的文字

python使用pytesseract识别图片中的文字

python 识别图片中的文字信息

python--识别图片中的文字

Python 识别图片中表格

【python人脸识别】使用opencv识别图片中的人脸

python实战===用python识别图片中的中文

python-opencv-人脸识别实现从图片中扣人脸

python 包的使用（二）——tesseract识别图片中的文字

python识别图片中的文字处理方法

使用Python进行OCR识别图片中的文字

python利用pytesser3识别图片中的文字信息

Python 识别图片中的文字—OCR实战教程

python识别图片中的文字、数值并转文档

通过Python的pytesseract库识别图片中的文字

Python Opencv实践 - 入门使用Tesseract识别图片中的文字

python生成词云时，图片中的汉字出现口字型错误

【Python • 图片识别】pytesseract快速识别提取图片中的文字

Python 利用百度文字识别 API 识别并提取图片中文字

利用python识别图片中的条码（pyzbar）及条码图片矫正和增强

批量识别图片中文字（python、百度开发者工具）

Python通过百度Ai识别图片中的文字

Python实现识别图片中的所有人脸并显示出来

利用百度文字识别图片中的文字(python版)

详解利用python+opencv识别图片中的圆形（霍夫变换）

【课外拓展】：识别图片中的二维码（python+opencv+pyzbar）

超简单使用Python识别图片中的中/英文字/包含工具下载链接

今日推荐

Linus “吃狗粮”最积极！

开源日报 | Winamp播放器即将开源；生成式AI之战升级第二轮；Linus“吃狗粮”最积极；AI进入泡沫前期；吴泳铭为阿里云带来了什么？

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

周排行

LogN级别的区间查询算法(线段树), 你学会了吗

数论概论(英文版.第4版)

idea 更新后和新的直接安装前，都需要配置 idea64.exe.vmoptions 后再使用

CANOpen系列教程04_CAN总线波特率、位时序、帧类型及格式说明

Java序列化基础

java排序算法整理

异常：org.apache.ibatis.reflection.ReflectionException

（算法练习）——二路归并排序

go 闭包函数

好程序员web前端技术分享媒体查询

每日归档

更多

2024-05-21(8)

2024-05-20(36)

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)