【发布时间】:2022-01-18 21:55:26
【问题描述】:
我正在制作我的第一个小型网络抓取程序。我正在尝试获取产品的价格,但 soup.find 返回“无”。
import requests
from bs4 import BeautifulSoup
site = 'https://www.pichau.com.br/placa-de-video-asus-geforce-gtx-1650-dual-4gb-gddr5-128-bit-dual-gtx1650-4g'
headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/96.0.4664.110 Safari/537.36 OPR/82.0.4227.50'}
page = requests.get(site, headers=headers)
soup = BeautifulSoup(page.content, 'html.parser')
price = soup.find(class_ = 'jss237')
print(price)
但是,如果我得到覆盖整个事物的盒子的类,这将返回 None,就像这样
price = soup.find(class_ = 'MuiGrid-root MuiGrid-item MuiGrid-grid-xs-12 MuiGrid-grid-sm-5').get_text()
它返回所有内容,包括我想要获得的价格
Placa de Video Asus GeForce GTX 1650 Dual, 4GB, GDDR5, 128-bit, DUAL-GTX1650-4G...SKU: DUAL-GTX1650-4Gà vistaR$1.989,00no PIX com 12% descontoR$ 2.260,23em até 12x de 188,35sem juros no cartão CaracterísticasGarantia: 12 Meses
【问题讨论】:
-
您的预期输出/目标是什么?您只希望退回价格吗?
标签: python web-scraping beautifulsoup