【文本处理词频统计】python 实现词频统计 - 代码天地

【文本处理词频统计】python 实现词频统计

其他 2018-08-16 10:31:00 阅读次数: 0

自定义词频统计函数：wordcount

# -*- encoding=utf-8 -*-

import string
import pandas as pd

word_list=[]
freq_list=[]
def wordcount(path):
    with open(path,'r',encoding='utf-8') as text:
        words = [raw_word.strip(string.punctuation).lower() for raw_word in text.read().split()]
        words_index = set(words)
        words_count = {index:words.count(index) for index in words_index}
    for  word in sorted(words_count ,key=lambda x:words_count[x],reverse=True):
        print('{} {}'.format(word,words_count[word]))
        word_list.append(word)
        freq_list.append(words_count[word])



if __name__ == '__main__':

    path = 'F:\\标签库\\data\\aa.csv'
    result=pd.DataFrame({"word":word_list,"freq":freq_list})
    result.to_csv('F:\\标签库\\data\\bb.csv',index=False)

E:\laidefa\python.exe "E:/Program Files/pycharmproject/文本关键词提取/词频统计.py"
vs 2960
情况 1560
联赛 1473
亚盘 1337
分析 1239
优势 1014
主胜 925
后市 890
支持 846

猜你喜欢

转载自blog.csdn.net/u013421629/article/details/81028154

【文本处理词频统计】python 实现词频统计

Python文本词频统计

【Python】文本词频统计

Python之文本词频统计

Python-文本词频统计

Python实例--文本词频统计

文本处理、词频统计与Simhash生成文档指纹

python实现词频统计

Python实现文本词频统计算法及完整代码

文本词频统计

python day 17 文本词频统计

二级python——文本词频统计

Python实例分析——文本词频统计

统计文本词频的几种方法（Python）

Python自然语言处理—统计词频

python实现词频统计并展示

Python英文文本词频统计——读取英文文本进行词频统计并输出

python3.6 抓取网页文本并实现词频统计-自然语言处理小项目

实例10：文本词频统计

jieba和文本词频统计

词频统计（Java实现）

Java实现词频统计

Elasticsearch词频统计实现

python词频统计

Python 词频统计

Python 简易词频统计

python统计词频

统计词频 -- Python

词频统计（python）

Python之词频统计

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)