這是python實戰(zhàn)的第二次作業(yè),這次學(xué)會了查文檔來進行學(xué)習(xí)
最終成果圖

爬取商品信息結(jié)果.png
我的代碼
__author__ = 'Reborn'
from bs4 import BeautifulSoupimport rewith
open(r"E:/study/Workspaces/pycharm/source/1_2answer_of_homework/index.html",'r') as ht_data:
Soup = BeautifulSoup(ht_data,'lxml')
picadds = Soup.select('body > div > div > div.col-md-9 > div > div > div > img')
titles = Soup.select('body > div > div > div.col-md-9 > div > div > div > div.caption > h4 > a')
prices = Soup.select('body > div > div > div.col-md-9 > div > div > div > div.caption > h4.pull-right')
rates = Soup.select('body > div > div > div.col-md-9 > div > div > div > div.ratings > p > span')
nums = Soup.select('body > div > div > div.col-md-9 > div > div > div > div.ratings > p.pull-right')
star = [];index = 0
for index in range(1,len(rates)):
string=str(rates[index])
if re.search("empty",string) != None:
star.append('☆')
else:
star.append('★')
flag = 0
for picadd,title,price,sta,num in zip(picadds,titles,prices,star,nums):
data = {
'picadd': picadd.get("src"),
'title' : title.get_text(),
'price' : price.get_text(),
'star' : ''.join(star[flag:flag+4]),
'num' : num.get_text()
}
flag += 5
print(data)
我的感悟
- 現(xiàn)在才學(xué)會把文檔用起來,發(fā)揮它真正的作用
- 薄弱的基礎(chǔ),還需要多翻幾次課本。多看一些代碼
- 保持獨立思考,用自己的力量把問題解決