python爬虫猫眼电影排行top100实例 - 代码天地

python爬虫猫眼电影排行top100实例

编程语言 2019-03-16 13:20:52 阅读次数: 0

今天是个好天气，培训了一个月了，可以看的懂python代码，一直对爬虫比较感兴趣，今天星期六没上课就看视频，跟着老师敲代码，中间各种错误，到饭点了才弄好，成功爬取！这个时刻也是值得纪念的，心情和天气一样晴朗。感兴趣的朋友也可以照下面的代码自己敲一遍，运行一下看看效果。
import requests,re
from requests.exceptions import RequestException
import json
def get_one_page(url):
try:
response = requests.get(url)
if response.status_code == 200 :
return response.text
return None
except RequestException :
return None
def parse_one_page(html):
pattern = re.compile(’

. ?board-index.?>(\d+)</i. ?data-src="(.?)". ?name"><a’
+’.?>(. ?).?star">(. ?)

.?releasetime">(. ?)’
+’.?integer">(. ?).?fraction">(. ?).?’,re.S)
items = re.findall(pattern,html)
for item in items:
yield {
‘index’:item[0],
‘image’: item[1],
‘title’: item[2],
‘actor’: item[3].strip()[3:],
‘time’: item[4].strip()[5:],
‘score’: item[5]+item[6],
}
def write_to_file(content):
with open(‘result.txt’,‘a’,encoding=‘utf-8’) as f :
f.write(json.dumps(content,ensure_ascii=False)+’\n’)
f.close()
def main(offset):
url = “ https://maoyan.com/board/4?offset=”+str(offset)
html = get_one_page(url)
parse_one_page(html)
for item in parse_one_page(html):
print(item)
write_to_file(item)
if name == ‘ main’:
for i in range(10):
main(i*10)

告诉你使我达到目标的奥秘吧，我唯一的力量就是我的坚持精神。

猜你喜欢

转载自blog.csdn.net/weixin_44651916/article/details/88594966

python爬虫猫眼电影排行top100实例

Python爬取猫眼电影排行TOP100的电影

抓取猫眼电影排行top100

python网络爬虫--正则表达式抓取猫眼电影排行TOP100

Python爬虫学习案例之抓取猫眼电影排行Top100

Python爬虫之一：抓取猫眼电影TOP100

python爬虫爬取猫眼电影Top100

python爬虫，爬取猫眼电影top100

python爬虫入门 ✦ 爬取猫眼电影Top100

python爬虫入门 ✦ 爬取猫眼电影Top100

python爬虫--猫眼电影TOP100榜爬取

网络爬虫-猫眼电影top100

爬虫_抓取猫眼电影TOP100

python爬虫入门新手向实战 - 爬取猫眼电影Top100排行榜

使用正则表达式爬虫抓取猫眼电影排行Top100

python爬虫开发之使用Python爬虫库requests多线程抓取猫眼电影TOP100实例

python简单爬虫实例4之猫眼网top100抓取特定内容（100个电影）

正则匹配的抓取猫眼电影排行Top100

00_抓取猫眼电影排行TOP100

猫眼电影top100

python-猫眼爬虫Top100

python猫眼top100实例

python爬取猫眼电影top100排行榜

python爬虫：爬取猫眼TOP100榜的100部高分经典电影

python：猫眼电影TOP100的电影爬取

【网络爬虫实战】猫眼电影Top100

爬虫练习 | 爬取猫眼电影Top100

猫眼电影top100票房爬虫 Request + 正则

爬虫六之爬取猫眼电影top100

python爬虫实战：利用beautiful soup爬取猫眼电影TOP100榜单内容-1

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

循环神经网络（rnn）讲解

Tigao教程四：单独的关节运动

金蝶K3WISE15.0-注册套打教程

如何在Mac上配置Kubernetes

Android应用结束自身进程的方法

SpringMVC学习十三拦截器栈

中国驻洛杉矶总领馆举行新春招待会

HttpClient get post 发送

11 - three.js 笔记 - 绘制三维字体模型

Mysql递归获取某个父节点下面的所有子节点和子节点上的所有父节点

每日归档

更多

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)