urllib库与requests库爬虫 - 代码天地

urllib库与requests库爬虫

其他 2020-08-04 10:42:15 阅读次数: 0

首先介绍urllib库爬取网页内容。
需要lxml，urllib库
以我的博客为例爬取相关资料

import urllib.parse
import lxml.html
import urllib.request
import time
url='https://blog.csdn.net/Xiang_lhh/article/details/104940609'
#
resp=urllib.request.urlopen(url)#提交请求
html=lxml.html.parse(resp)
ps=html.xpath('//p/text()')#爬取p标签的内容，涉及到定位元素
for p in ps :
	print(p)
time.sleep(5)

此时将会把p标签中的内容输出。
requests库

import requests
import time
import lxml.html
from lxml import etree
test_url='https://blog.csdn.net/Xiang_lhh/article/details/104940609'
resp=requests.get(url=test_url).text
html=etree.HTML(resp)
ps=html.xpath('//p/text()')
for p in ps :
	print(p)
time.sleep(5)

此时，使用requests库，将内容输出

猜你喜欢

转载自blog.csdn.net/Xiang_lhh/article/details/105332865

爬虫Urllib库，Requests库

urllib库与requests库爬虫

【Python爬虫】requests与urllib库的区别

Python爬虫库urllib，requests基本方法

爬虫学习打卡1——urllib库和requests库

Python爬虫之urllib库和requests库的基本使用

【python&爬虫】快速入门urllib库和requests库

爬虫入门：（二）爬虫请求库urllib和requests

requests与urllib库的区别

python爬虫——urllib库

python爬虫，Urllib库

爬虫中urllib库

Python 爬虫 ---- urllib 库

爬虫库Urllib

爬虫之urllib库

【python】urllib库（爬虫）

爬虫值requests库

爬虫基础——requests库

Python爬虫-Requests库

爬虫- Requests库

爬虫3：requests库

Python爬虫 --requests库

Python爬虫------requests库

Python爬虫——Requests库

爬虫requests库

爬虫之Requests库

爬虫相关--requests库

python爬虫 - requests库

爬虫：requests库的使用

爬虫入门-Requests库

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

周排行

Python环境安装与基础语法（1）——计算机基础知识

IMU预积分

ADAS中的LDW、FCW、BSD、LCA、ACC、AEB、APA、DMS代表的含义

B站笔试两道题

skyeye arm 硬件虚拟机环境的搭建

Web前端静态页面示例

数组-合并排序数组 II-简单

springcloud之版本问题启动报错

面向对象-------------匿名对象(六)

输入URL到页面呈现中间发生了什么？

每日归档

更多

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)