Python爬虫框架开发环境简介和Scrapy安装

一、开发环境
Centos6.0
Scrapy1.5
Python3.6
Mongodb3.6
Pycharm

二、Scrapy1.5安装
pip install scrapy

    如果Python3.6时通过编译安装的,先进行如下设置,设置环境变量,设置别名。
    ![](https://s1.51cto.com/images/blog/201902/08/46f23f6aed868c0c87133a58a6868430.png?x-oss-process=image/watermark,size_16,text_QDUxQ1RP5Y2a5a6i,color_FFFFFF,t_100,g_se,x_10,y_10,shadow_90,type_ZmFuZ3poZW5naGVpdGk=)
    ![](https://s1.51cto.com/images/blog/201902/08/8029366f42a0b3d1f47f0724678652dd.png?x-oss-process=image/watermark,size_16,text_QDUxQ1RP5Y2a5a6i,color_FFFFFF,t_100,g_se,x_10,y_10,shadow_90,type_ZmFuZ3poZW5naGVpdGk=)
    安装遇到的问题:
    ![](https://s1.51cto.com/images/blog/201902/08/d74444564d7db6d8511e955e4addcb02.png?x-oss-process=image/watermark,size_16,text_QDUxQ1RP5Y2a5a6i,color_FFFFFF,t_100,g_se,x_10,y_10,shadow_90,type_ZmFuZ3poZW5naGVpdGk=)
    pip无法请求HTTPS类型的网站,原因是缺少SSL模块。解决办法安装openssl-devel。命令是 sudo yum install -y openssl-devel
    ![](https://s1.51cto.com/images/blog/201902/08/ae3f3139a496603e91f3193547b9785a.png?x-oss-process=image/watermark,size_16,text_QDUxQ1RP5Y2a5a6i,color_FFFFFF,t_100,g_se,x_10,y_10,shadow_90,type_ZmFuZ3poZW5naGVpdGk=)
    SSL安装成功
    ![](https://s1.51cto.com/images/blog/201902/08/b57250a0e9b494da2a4473ebf98a6483.png?x-oss-process=image/watermark,size_16,text_QDUxQ1RP5Y2a5a6i,color_FFFFFF,t_100,g_se,x_10,y_10,shadow_90,type_ZmFuZ3poZW5naGVpdGk=)
    现在执行pip3 install scrapy 还是报错,这是因为我们的Python3.6是通过编译安装的,所以要重新编译Python以兼容SSL模块。
    ![](https://s1.51cto.com/images/blog/201902/08/868ef7652adcbe322d8cda2427cf6c2e.png?x-oss-process=image/watermark,size_16,text_QDUxQ1RP5Y2a5a6i,color_FFFFFF,t_100,g_se,x_10,y_10,shadow_90,type_ZmFuZ3poZW5naGVpdGk=)
    ![](https://s1.51cto.com/images/blog/201902/08/9c4a90e3ee750a765c8919193ccc3dc6.png?x-oss-process=image/watermark,size_16,text_QDUxQ1RP5Y2a5a6i,color_FFFFFF,t_100,g_se,x_10,y_10,shadow_90,type_ZmFuZ3poZW5naGVpdGk=)
    编译安装执行make && make install完毕后,再执行pip3 install scrapy又报了一个错误,找不到twisted大于13.1.0的版本。
    ![](https://s1.51cto.com/images/blog/201902/08/a97b01ea06c499b699f65bc52a18d52e.png?x-oss-process=image/watermark,size_16,text_QDUxQ1RP5Y2a5a6i,color_FFFFFF,t_100,g_se,x_10,y_10,shadow_90,type_ZmFuZ3poZW5naGVpdGk=)
    打开浏览器,输入https://pypi.org/ 搜索twisted并下载。完成后通过rz命令导入到服务器中,移动到soft文件中。使用tar -xvjf 进行解压缩,进入到解压后的twisted文件中,使用Python3 setup.py install安装。
    再次pip3 install scrapy,成功。安装位置在python安装文件的bin目录下,编辑.bashrc,设置别名。使用source .bashrc编译。在命令行输入scrapy即可。

猜你喜欢

转载自blog.51cto.com/13870710/2348957
今日推荐