五码不卡视频,免费看一级夫妻生活,看美女逼逼久久久

嗯……之前用Scrapy寫過(guò)一個(gè)，并且成功爬了下來(lái)，奈何效率低下而且配置蛋疼，今天在網(wǎng)易云課堂看到不錯(cuò)的教程，用他現(xiàn)成的代碼改了下，竟也運(yùn)行成功，而且效率高了不少。在此Mark下，方便以后參考。

因?yàn)檎搲拗扑阉鲿r(shí)間在一年以內(nèi)，為了方便本地查找資源，因此目標(biāo)為保存所有的專輯及鏈接到一個(gè)HTML中。

思路很簡(jiǎn)單，首先打開網(wǎng)頁(yè)，篩選需要抓取的數(shù)據(jù)（鏈接、標(biāo)題），保存瀏覽器的Cookies及User-Agent到一個(gè)Dict里，用requests庫(kù)加上之前的User-Agent下載網(wǎng)頁(yè)，使用BeautifunSoup解析后，調(diào)試CSS Selector到能夠準(zhǔn)確指向所需信息，然后將專輯及對(duì)應(yīng)的鏈接分別放置在一個(gè)Dict里，處理輸出即可。

from bs4 import BeautifulSoup
import requests
import time

#定義需要爬取的頁(yè)面及headers
url = 'https://hostname/bbs/thread.php?fid=fid&page=1'
urls = ['https://hostname/bbs/thread.php?fid=fid&page={}'.format(str(i)) for i in range(start,end,step)]
headers = {
    'User-Agent':'',
    'Cookie':''
}

#爬取專輯信息
def getAlbumInfo(url,data=None):
    wb_data = requests.get(url,headers=headers)
    time.sleep(4)
    soup = BeautifulSoup(wb_data.text,'lxml')
    titles    = soup.select('tr.t_one > td > h3 > a')
    links     = soup.select('tr.t_one > td > h3 > a')
    data = []
    if data == []:
        for titles,links in zip(titles,links):
            datas = {
                'title'  :titles.get_text(),
                'link'  :links.get('href'),
                }
            data.append(datas)
    print(data)
    return data

#遍歷爬取結(jié)果及輸出為HTML鏈接
def dataProcessAndWriteToFile(result):
    for links in result:
        url = '<a '+'href=https://hostname/bbs/'+str(links['link'])+'>'+str(links['title'])+'</a>'+'</br>'
        print(url)
        f = open('output.html','a',encoding='utf-8')
        f.write(url)
        f.close()

#執(zhí)行
for url in urls:
    current = getAlbumInfo(url)
    dataProcessAndWriteToFile(current)

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九欧美,1769亚洲,黄色成人av

「Spider」爬取A壇資源鏈接

「Spider」爬取A壇資源鏈接

相關(guān)閱讀更多精彩內(nèi)容

友情鏈接更多精彩內(nèi)容

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九 欧美,1769亚洲,黄色成人av

「Spider」爬取A壇資源鏈接

相關(guān)閱讀更多精彩內(nèi)容

友情鏈接更多精彩內(nèi)容

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九欧美,1769亚洲,黄色成人av