爬虫：淘宝价格

首页 > 代码库 > 爬虫：淘宝价格

2024-10-10 01:03:01 208人阅读

 1 import requests
 2 import re
 3 
 4 def getHTMLText(url):
 5     try:
 6         r = requests.get(url, timeout = 30)
 7         r.raise_for_status()
 8         r.encoding = r.apparent_encoding
 9         return r.text
10     except:
11         return "error_1"
12 
13 def parsePage(ilt, html):
14     try:
15         plt = re.findall(r‘\"view_price\"\:\"[\d\.]*\"‘, html)
16         tlt = re.findall(r‘\"raw_title\"\:\".*?\"‘, html)
17         for i in range(len(plt)):
18             price = eval(plt[i].split(‘:‘)[1])
19             title = eval(tlt[i].split(‘:‘)[1])
20             ilt.append([price, title])
21     except:
22         print("error_2")
23 
24 def printGoodsList(ilt):
25     tplt = "{:4}\t{:8}\t{:16}"
26     print(tplt.format("序号", "价格", "商品名称"))
27     count = 0
28     for g in ilt:
29         count = count + 1
30         print(tplt.format(count, g[0], g[1]))
31 
32 def main():
33     goods = ‘书包‘
34     depth = 2
35     start_url = ‘https://s.taobao.com/search?q=‘ + goods
36     infoList = []
37     for i in range(depth):
38         try:
39             url = start_url + ‘&s=‘ + str(44*i)
40             html = getHTMLText(url)
41             parsePage(infoList, html)
42         except:
43             continue
44     printGoodsList(infoList)
45 
46 main()

爬虫：淘宝价格

声明：以上内容来自用户投稿及互联网公开渠道收集整理发布，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任，若内容有误或涉及侵权可进行投诉：投诉/举报工作人员会在5个工作日内联系你，一经查实，本站将立刻删除涉嫌侵权内容。

联系
我们

首页 > 代码库 > 爬虫：淘宝价格

爬虫：淘宝价格

看完仍有疑问？有类似问题直接问程序猿