爬虫获取知乎登陆的网页信息

import requests
import re
url='https://www.zhihu.com/node/Register?params=%7B%22is_org_page%22%3Afalse%7D'
header={
    
    'user-agent':
      'Mozilla/5.0 (Windows NT 10.0; Win64; x64)' 
      'AppleWebKit/537.36 (KHTML, like Gecko)' 
      'Chrome/85.0.4183.83 Safari/537.36'}
a=requests.get(url,headers=header)
b=re.findall('[\u4e00-\u9fa5]{2,10}',a.text)
for i in b:
    print(i)

运行后图片

猜你喜欢

转载自blog.csdn.net/liaoqingjian/article/details/108430050