python实现电影天堂种子磁力的爬取 - 代码天地

python实现电影天堂种子磁力的爬取

其他 2018-08-27 13:11:28 阅读次数: 0

import requests,re



def getdetail(url):

    response = requests.get(url)
    #dytt的编码为gbk非utf-8
    
    html = response.content.decode('gbk')
    # 电影详情页标题
    movie_title_name = re.search('<h1><font color=#07519a>(.*)</f',html)
    
    movie_title = movie_title_name.group(1)
    # 电影 磁力   magnet
    movie_magnet_url = re.search('/><a href="(.*)"><str',html)
    
    # print(movie_magnet.group(1))
    movie_magnet = movie_magnet_url.group(1)
    # torrent种子
    movie_torrent_url = re.search('ddf"><a href="(.*)">ft',html)
    
    movie_torrent = movie_torrent_url.group(1)
    # print(movie_torrent.group(1))
    # 这个列表用来title
    movie_title_list = []
    movie_title_list.append(movie_title)

    # 这个列表两个下载的链接
    movie_down_url = []
    
    
    movie_down_url.append(movie_magnet)
    
    movie_down_url.append(movie_torrent)
    
    movie_down_url_all = []
    
    movie_down_url_all.append(movie_down_url)

    #保持标题，磁力，种子的同步准确性
    movie_dict = dict(zip(movie_title_list,movie_down_url_all))
    print(movie_dict)



def getpage():
    num = int(input('你要爬取多少页电影呢'))
    #获取每一页的url
    for i in range(1,num):
        lurl = 'http://www.dytt8.net/html/gndy/dyzz/list_23_%s.html' % i

        response = requests.get(lurl)

        html = response.text
        #取出电影详情页的url
        movie_url_list = re.findall('<a href="(.*)" class="ulink"',html)

        for movie_item in movie_url_list:
            movie_url = 'http://www.dytt8.net'+movie_item
            getdetail(movie_url)


if __name__ == '__main__':
    getpage()

猜你喜欢

转载自blog.csdn.net/majiexiong/article/details/81838677

python实现电影天堂种子磁力的爬取

爬取电影天堂电影磁力

python爬虫——爬取电影天堂磁力链接

Python爬取电影天堂

Python实现爬取电影天堂最新电影资源

python 爬取电影天堂电影续编

python 爬取电影天堂电影

爬取电影天堂

[python爬虫]爬取电影天堂连接

python利用requests模块，实现爬取电影天堂最新电影信息。

python3 爬取电影天堂最新电影

电影天堂数据爬取

爬取电影天堂资源并实现下载视频资源

电影天堂电影链接爬取

爬虫爬取电影天堂电影链接

Python笔记6——爬取电影天堂链接

零基础爬取电影天堂

电影天堂爬取详情页

XPath之电影天堂数据爬取

爬虫之爬取电影天堂（request）

Scrapy爬虫爬取电影天堂

xpath；；利用xpath爬取电影天堂

Python爬取电影天堂指定电视剧或者电影

[python爬虫之路day5]：实战之电影天堂2019精选电影爬取

教你如何用python来爬取电影天堂上面的电影

爬取电影天堂最新电影的名称和下载链接

BeautifulSoup爬取电影天堂全站电影资源

爬取电影天堂电影列表和详情页

爬取《电影天堂》，保存评分大于7.0 的电影地址

【PY】没有电影看？来教你用Python爬取电影天堂最新电影！

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

循环神经网络（rnn）讲解

Tigao教程四：单独的关节运动

金蝶K3WISE15.0-注册套打教程

如何在Mac上配置Kubernetes

Android应用结束自身进程的方法

SpringMVC学习十三拦截器栈

中国驻洛杉矶总领馆举行新春招待会

HttpClient get post 发送

11 - three.js 笔记 - 绘制三维字体模型

Mysql递归获取某个父节点下面的所有子节点和子节点上的所有父节点

每日归档

更多

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)