python bs4模块 BeautifulSoup 学习笔记 - 代码天地

python bs4模块 BeautifulSoup 学习笔记

其他 2018-10-20 19:11:54 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/wuchenlhy/article/details/81164643

bs4 模块的 BeautifulSoup 可以用来爬取html页面的内容，配合requests库可以写简单的爬虫。

1、利用requests请求html页面，获取HTML页面内容

import requests
from bs4 import BeautifulSoup


session = requests.session()

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/67.0.3396.99 Safari/537.36'
}

session.headers.update(headers)

# step 1  打开登陆页面
url = 'http://10.10.10.10/xx'
r = session.get(url)
html = r.text

2、利用BeautifulSoup，解析HTML得到想要的信息

soup = BeautifulSoup(html, 'html.parser')
# BeautifulSoup支持多种元素定位方式，也支持CSS定位,得到的是一个列表，列表中的元素信息可以用get方法获取
s1 = soup.select('#id')[0].get('value')
#S1 就是对应元素value属性的值
print(s1)

猜你喜欢

转载自blog.csdn.net/wuchenlhy/article/details/81164643

python bs4模块 BeautifulSoup 学习笔记

Python爬虫学习笔记（六）————BeautifulSoup（bs4）解析

python bs4 BeautifulSoup

python爬虫二:bs4库中的BeautifulSoup模块

python bs4(beautifulsoup4)

python爬虫学习笔记3：bs4及BeautifulSoup库学习

python bs4 BeautifulSoup用法

python 在linux上面安装beautifulsoup4(bs4) No module named 'bs4'

bs4——BeautifulSoup模块：解析网页

python 爬虫之beautifulsoup（bs4）使用

find_all的用法 Python（bs4，BeautifulSoup）

python 爬虫：BeautifulSoup(bs4) 找不到对应的元素

python库的解析--BeautifulSoup(bs4库)

python报错cannot import name ‘BeautifulSoup‘ from ‘bs4‘

python爬虫思路及BeautifulSoup bs4使用

Python bs4 BeautifulSoup库使用记录

Python学习笔记：BeautifulSoup模块

python学习笔记(bs4)

python(BS4模块)

bs4中的BeautifulSoup

Bs4 BeautifulSoup取值

windows下安装 bs4(BeautifulSoup4.3.2)模块

bs4模块 03 解析库beautifulsoup

【Python网络爬虫】150讲轻松搞定Python网络爬虫付费课程笔记篇八——爬虫解析库 bs4 BeautifulSoup

vscode Python 无法导入beautifulsoup4解决方案（bs4报错：vscode unresolved import 'beautifulsoup4'）

Python 驭虫术 bs4（BeautifulSoup4）库

Python中的BS4模块

python爬虫-bs4模块

python bs4模块快速入门

python—bs4模块解析

今日推荐

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

周排行

阿里云短信服务平台注册

Windows下的字符串处理(1)

sqoop: mysql导入数据到hdfs, hive, hbase

commons.lang中常用的工具类

离线安装PostgreSQL11.6

使用PyTorch简单实现卷积神经网络模型

一文彻底搞定谱聚类

一道面试题引发的血案

One Chat for Mac(聊天工具)

TCP/IP的底层队列是如何实现的？

每日归档

更多

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)