python3爬虫-urllib+BeautifulSoup - 代码天地

python3爬虫-urllib+BeautifulSoup

其他 2018-10-25 16:06:03 阅读次数: 0

版权声明：如有侵权，请联系，如有错误，望指正，欢迎转载 https://blog.csdn.net/qq_29630271/article/details/79265797

urllib

在Python2版本中，有urllib和urlib2两个库可以用来实现request的发送。而在Python3中，已经不存在urllib2这个库了，统一为urllib。Python3 urllib库包括了四个模块。
urllib.request for opening and reading URLs
urllib.error containing the exceptions raised by urllib.request
urllib.parse for parsing URLs
urllib.robotparser for parsing robots.txt files

import urllib.request
from bs4 import BeautifulSoup

response = urllib.request.urlopen("http://www.biqukan.com/1_1094/")
html = response.read().decode("gbk")
div_bf = BeautifulSoup(html)
div = div_bf.find_all('div', class_ = 'listmain')
a_bf = BeautifulSoup(str(div[0]))
a = a_bf.find_all('a')
for each in a:
    print(each.string, each.get('href'))

猜你喜欢

转载自blog.csdn.net/qq_29630271/article/details/79265797

python3爬虫-urllib+BeautifulSoup

python3: 爬虫---- urllib, beautifulsoup

Python3爬虫--两种方法（requests(urllib)和BeautifulSoup）爬取网站pdf

python3 爬虫（requests+BeautifulSoup）

python3 爬虫（一）--初识urllib

Python3爬虫实战（urllib模块）

python3 urllib爬虫抓取记录

Python3爬虫urllib使用介绍

Python3爬虫笔记 -- urllib

Python3爬虫urllib库的使用

Python爬虫（urllib.request和BeautifulSoup）

Python3爬虫(3)_urllib.error

python3 urllib

python3 爬虫相关-requests和BeautifulSoup

python3实现网络爬虫（2）--BeautifulSoup使用（1）

Python3中beautifulsoup库的使用(爬虫利器)

【python3爬虫】beautifulsoup4 安装

python3爬虫学习之beautifulsoup实战

python3爬虫学习之数据提取之beautifulsoup

python3 --- 基于requests + beautifulsoup 实现爬虫项目

Python3爬虫（一）：请求库之urllib

python3爬虫入门（urllib和requests简单使用）

python3爬虫学习之urllib库

python3使用urllib模块制作网络爬虫

python3爬虫(1)--urllib请求库使用

【Python3 爬虫】U02_urllib库

python3爬虫之Urllib库（一）

python3爬虫之Urllib库（二）

python3 BeautifulSoup模块

Python3 的urllib实例

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)