python学习爬虫（1）--环境搭建Python+requests+BeautifulSoup

其他 2020-01-26 11:41:02 阅读次数: 0

作者:IT 小样
爬虫，spider，通过爬虫程序可以爬取到网页你所需要的信息。实现爬虫程序的方法很多，本系列文主要介绍通过Python3+requests+BeautifulSoup来实现代码。
本篇简单介绍一下爬虫流程以及环境搭建

爬虫流程

发起请求——>获取响应数据——>解析数据后获取
发起请求获取响应数据，可以通过requests库来实现，而解析数据可以通过BeautifulSoup库来实现。这两个库使用起来相比其他方法会更简单便捷。

安装python

首先下载Python安装包，建议直接安装python3，最新版本即可。在安装时注意勾选添加路径至系统路径中，这样可以直接在命令行运行python命令而不用切换路径。

安装requests库

可以直接pip安装，pip install requests

安装BeautifulSoup

pip安装，pip install bs4
使用BeautifulSoup时，需要from bs4 import BeautifulSoup
此处还需要pip安装解析器 lxml，pip install lxml
安装lxml解析器的原因是因为其比Python自带的html.parser解析器效率更高,建议安装。

至此，基本上环境已经搭建完毕。

下一篇：Python学习爬虫（2）–requests库

IT小样

发布了39 篇原创文章 · 获赞 16 · 访问量 1万+

私信关注

猜你喜欢

转载自blog.csdn.net/weixin_31315135/article/details/88685424

python学习爬虫（1）--环境搭建Python+requests+BeautifulSoup

python爬虫基础（requests、BeautifulSoup）

【爬虫学习一】 Python实现简单爬虫（requests，BeautifulSoup）

python股票数据爬虫requests、etree、BeautifulSoup学习

Python学习笔记11：爬虫（requests和BeautifulSoup）

Python爬虫之BeautifulSoup和requests的使用

python爬虫之requests+selenium+BeautifulSoup

python3 爬虫（requests+BeautifulSoup）

Python网络爬虫笔记（四）——requests与BeautifulSoup

python爬虫基础Ⅰ——requests、BeautifulSoup：书本信息

python爬虫爬取招聘（ requests，BeautifulSoup）

Python爬虫学习1：requests库

python爬虫学习1——Requests库

python3爬虫(基于requests、BeautifulSoup4)之环境配置

python爬虫之BeautifulSoup学习

Python学习之requests、 BeautifulSoup(一)

Python爬虫学习三------requests+BeautifulSoup爬取简单网页

[Python][爬虫03]requests+BeautifulSoup实例:抓取图片并保存

python3 爬虫相关-requests和BeautifulSoup

python爬虫使用requests和BeautifulSoup出现中文乱码

python 爬虫proxy,BeautifulSoup+requests+mysql 爬取样例

python3 --- 基于requests + beautifulsoup 实现爬虫项目

Python使用requests及BeautifulSoup构建爬虫实例代码

python爬虫基础入门——利用requests和BeautifulSoup

python爬虫1——Requests库

python爬虫实战（1）——开发环境搭建

python爬虫beautifulsoup4系列1

Python爬虫库-1-BeautifulSoup的使用

python爬虫-requests初步学习

python爬虫第一弹之图片- BeautifulSoup与requests的完美结合（用requests和BeautifulSoup进行爬虫）

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)