Python Hacking Tools - Web Scraper

Preparation:

Python Libray in the following programming:

1. Requests Document: https://2.python-requests.org//en/master/

2. Beautiful Soup Documentation: https://www.crummy.com/software/BeautifulSoup/bs4/doc/

Install the lib on Kali Linux:

apt-get install python-requests

apt-get install python-bs4

Proxy Domain:

https://free-proxy-list.net/

 Python Scraper Code:

import requests
from bs4 import BeautifulSoup

proxyDomain = "https://free-proxy-list.net/"

requests.get(proxyDomain)

soup = BeautifulSoup(r.content, 'html.parser')

table = soup.find('table', {"id" : "proxylisttable"})

for row in table.find_all('tr'):
    columns = row.find_all('td')
    try:
        print "%s:%s\t%-40s\t%-10s" %(columns[0].get_text(),columns[1].get_text(),columns[2].get_text(),columns[3].get_text())
    except:
        pass

猜你喜欢

转载自www.cnblogs.com/keepmoving1113/p/11312429.html