Python数据爬取，并存储到mysql数据库 - 代码天地

Python数据爬取，并存储到mysql数据库

编程语言 2023-06-05 07:14:53 阅读次数: 0

import pymysql
import requests
from bs4 import BeautifulSoup
import lxml
message_list = []

def get_content():
    url = "http://www.scetc.cn/reList"
    headers = {"User-Agent": "Mozilla/5.0(compatible;MSIE 9.0;Windows NT 6.1;Trident / 5.0)"}
    response = requests.request(url=url, headers=headers,method="GET")
    response.encoding = 'utf-8'
    html = response.text
    return html

def get_path():
    html = get_content()
    soup = BeautifulSoup(html,'lxml')
    list = soup.select('div[class="newsbox"] ul li a')
    for a in list:
        href = a['href']
        message_list.append(href)

def add(name,site,time,place,major,remark):
    con = pymysql.connect(host='localhost', user='root', password='123456', database='test')
    cursor = con.cursor()
    sql = "insert into employment(name,site,time,place,major,remark)values (%s,%s,%s,%s,%s,%s)"
    infor_list = [name, site, time, place, major, remark]
    cursor.execute(sql, infor_list)
    cursor.close()
    con.close()
    print("数据存储成功！")

def data_store():
    get_path()
    for path in message_list:
        url = "http://www.scetc.cn/"+path
        headers = {"User-Agent": "Mozilla/5.0(compatible;MSIE 9.0;Windows NT 6.1;Trident / 5.0)"}
        response = requests.request(url=url, headers=headers, method="GET")
        response.encoding = 'utf-8'
        htmls = response.text
        soup = BeautifulSoup(htmls, 'lxml')
        list = soup.select('div[class="flat-wrapper"] table tr td')
        employment = []
        for a in list:
            content = a.string
            employment .append(content)
        print(employment)
        #add(employment[1], employment[3], employment[5], employment[7], employment[9], employment[11])

if __name__=='__main__':
    data_store()

猜你喜欢

转载自blog.csdn.net/weixin_57803787/article/details/124873903

Python数据爬取，并存储到mysql数据库

python利用pyquery库实现爬取豆瓣电影排行top250并存储到mysql数据库中

Scrapy爬取全站数据并存储到数据库和文件中

Python3爬取前程无忧数据分析工作并存储到MySQL

Python之爬取前程无忧数据分析工作并存储到MySQL

分类爬取新闻并存入mysql数据库

Python——爬取豆瓣电影信息并存储数据库

python 爬虫爬取最好大学网，并存入 mysql 数据库

Python爬虫之路-多进程爬取在线课程并存入MySQL数据库

Python爬虫之路-爬取在线课程并存入MySQL数据库

Python数据爬虫学习笔记（20）Scrapy爬取当当图书数据并存储至SQLite数据库

python爬取携程网航班机票信息并存储到数据库中，2020年最新版本

Java爬取丁香医生疫情数据并存储至数据库

Scrapy爬取数据存储到Mongodb数据库

Python爬取论文标题、作者、摘要等信息并存入MySQL--简述爬虫是如何将爬取数据存入MySQL数据库

雪球网爬取数据并存入数据库

雪球--数据的爬取并存入数据库

requests+正则爬取猫眼电影并将数据存储到mysql数据库

猫眼电影爬取(三)：requests+pyquery，并将数据存储到mysql数据库

猫眼电影爬取(二)：requests+beautifulsoup，并将数据存储到mysql数据库

cheerio爬取网页数据，存储到MySQL数据库

Java爬网页数据，并存储到本地数据库中

Python ：selenium 爬取Ajax技术网页，并存入MySQL数据库和本地CSV文件

Python使用Scrapy框架爬取某网站职位数据并存放到MySQL数据库（支持二级页面爬取）

Python 爬取数据并存入SQL Server数据库

爬虫--爬取网页信息存储到Mysql数据库

Scrapy爬取豆瓣电影并存入MySQL数据库

爬取西刺代理，并存入mysql数据库

基于ThinkPHP5 使用QueryList爬取并存入mysql数据库

python 爬取微博实时热搜，并存入数据库实例

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)