Python基础知识—文件批量处理

其他 2021-12-15 08:16:43 阅读次数: 0

Python基础知识—文件批量处理

找到所有文件
- os.listdir()
- os.path.join()
找到文件特定字段
- re.findall()
- os.path.join()
替换
- os.path.join()
- re.sub()
- string.startwith()

Q: 找到所有文件中的特定字段，然后替换掉这个特定字段

1）初步思考

步骤：
- 遍历所有文本文件
- 找到文件中特定字段
- 替换掉这个特定字段

2）找到所有文件

import os
print(os.listdir("../test"))

['new_file.txt']

3）找到文件特定字段

for filename in os.listdir("../test"):
    file_path = os.path.join("test", filename)
    with open(file_path, "r") as f:
        print(file_path, ":", f.read())
        
new_file.txt : some text...
add new line
百度 https://baidu.com, 这个 www.baidu.com 可以访问到百度

import re

string = "百度 https://baidu.com, 这个 www.baidu.com 可以访问到百度"
res = re.findall(r"(http://)?(baidu.com)", string)
for r in res:
    print(r[1])
    
baidu.com
baidu.com

4）替换

有俩个方案：

在原文本上替换，并覆盖原文本的内容
复制出一个新的文件，将原文本替换过的文字拷贝到新文件中，原文件不改变

for filename in os.listdir("../test")
	  file_path = os.path.join("test", filename)
    with open(file_path, "r") as f1:
        string = f1.read()
        new_string = re.sub(r"baidu.com", "google.com", string)
        with open(os.path.join("test", "new_"+filename), "w") as f2:
            f2.write(new_string)

for filename in os.listdir("../test"):
		if filename.startswith("new_"):
				continue
    file_path = os.path.join("test", "new_"+filename)
    with open(file_path, "r") as f:
        print(file_path, ":", f.read())
        
some text...
add new line
百度 https://google.com, 这个 www.google.com 可以访问到百度

参考：[莫烦Python](

猜你喜欢

转载自blog.csdn.net/Mrwei_418/article/details/121117234

Python基础知识—文件批量处理

python基础知识点，文件处理合集

python基础知识6---文件处理

python基础知识-文件

Linux基础知识之文件处理

Python爬虫基础知识：异常的处理

Python基础知识第四篇：方法重写+文件处理+异常处理，冒死上传

Python基础知识5（文件的读写）

Python基础知识之文件（四）

Python基础知识—文件管理

python基础知识5---数据类型、字符编码、文件处理

python入门《基础知识7--目录和文件高级操处理，shutil模块》

文件基础知识

Python基础（二十五）：异常处理基础知识

Python地理数据处理二：Python基础知识

《Python编程》015 – Python异常处理基础知识

python 基础知识

python基础知识

python的基础知识

基础知识python

python 的基础知识

Python 基础知识！

Python - 基础知识

Python——基础知识

python | 基础知识

python:基础知识

【python基础知识】16.文件读写基础及操作

python基础知识整理——错误以及异常处理

python爬虫基础知识整理——urlerror异常处理

python基础知识三——try与except处理异常语句

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)