Python 爬虫 urllib.request 对比 requests - 代码天地

Python 爬虫 urllib.request 对比 requests

物联网 2023-08-06 14:02:33 阅读次数: 0

get请求对比: get请求相对urllib.rqquest 则没有需要特别的转码就可以得到响应的数据，非茶馆的方便， requests中只需要 .text 属性就能获取到源码，而urllib.request.urlopen()之后还得.read().decode('utf-8')去解码才能获取到解码后的源码，很不友好。

# requests   代码
import requests

url = "https://www.baidu.com/s?"

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/103.0.0.0 Safari/537.36'
}

datas = {
    'wd':'北京'
}
response = requests.get(url=url,params=datas,headers=headers)

content = response.text
with open("北京.html","w",encoding='utf-8') as op:
    op.write(content)
print(content)

import urllib.request
import urllib.parse

base_url = "https://www.baidu.com/s?"

data = {
    'wd':'周杰伦',
    'sex':'男',
    'location':'中国台湾省'
}
new_date = urllib.parse.urlencode(data)
base_url+=new_date

headers={'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/103.0.0.0 Safari/537.36'}
request = urllib.request.Request(url=base_url,headers=headers);
response = urllib.request.urlopen(request)
content = response.read().decode('utf-8')
print(content)

post 请求对比：

# 对比urllib.request
# 优点:  1.post请求不需要编解码    2.post请求参数是data   3.不需要请求对象的定制

import requests
import json

url = "https://fanyi.baidu.com/sug"

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/103.0.0.0 Safari/537.36'
}

datas = {
    'kw':'ever'
}

# 对比urllib.request
# 优点:  1.post请求不需要编解码    2.post请求参数是data   3.不需要请求对象的定制
response = requests.post(url=url,headers=headers,data=datas)

# content = response.text
# print(content)
content = response.text
print(content)
print("------")
obj = json.loads(content)
print(obj)

猜你喜欢

转载自blog.csdn.net/weixin_46310452/article/details/126004939

Python 爬虫 urllib.request 对比 requests

Python爬虫实践 —— urllib.request和requests

Python 爬虫：urllib.request

python爬虫的urllib与requests的对比

Python爬虫（urllib.request和BeautifulSoup）

Python3 内置http.client,urllib.request及三方库requests发送请求对比

Python3 内置 http.client,urllib.request及三方库 requests 发送请求对比

爬虫urllib.request

Python-爬虫03：urllib.request模块的使用

python爬虫基础知识（一）--Urllib.request

爬虫 urllib.request 模块

1.0 -Python爬虫-Urllib/Requests

Python 3 urllib.request

【Python】python3网络爬虫-urllib.request发送请求

python3网络爬虫一《使用urllib.request发送请求》

Python爬虫之爬取内涵吧段子（urllib.request）

【Python爬虫】使用urllib.request下载已知链接的网络资源

用Python第一个爬虫程序（urllib.request)

python爬虫实践2：用urllib.request爬取天气网的图片

爬虫学习-urllib.request信息发送

爬虫基础 || 1.2 urllib.request

【Python爬虫】requests与urllib库的区别

Python爬虫库urllib，requests基本方法

Python中的urllib.request模块

Python——urllib.request模块的使用

python中urllib.request对象案例

urllib与requests的对比

python 爬虫访问网页之request与requests：

python库的解析--urllib.request 用于打开 URL 的可扩展库(urllib.request库)

【爬虫】使用urllib.request去爬取小说

今日推荐

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

《2024 年一季度互联网投融资运行情况》研究报告

报告：Django 仍然是 74% 开发者的首选

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

周排行

记一下去大梅沙的准备（2018-05-26）

Spring 注解事务

基于HTTP协议的客户端缓存

阿里云rds 备份和还原

[PHP] 几个拖慢 PHP 程序/API 运行速度的点

python 代码风格------------PEP8规则

js控制json生成菜单——自制菜单（一）

将字符串: 'k:1|k1:2|k2:3|k3:4 ' ,处理成 python 字典: {'k':1, 'k1':2, ...}

微信小程序转支付宝小程序

Qt551.窗口滚动条

每日归档

更多

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)