【发布时间】:2020-04-14 00:42:40
【问题描述】:
我想从以下网站上刮取一个项目的承诺金额,我使用相同的方法,但对于一个网站,代码没有返回值。
这段代码的输出是一个空数组:
import urllib
import requests
from lxml import html
url = 'https://www.kickstarter.com/projects/scratchideas/loki-the-ultra-portable-modular-and-robust-camera?ref=category'
page = requests.get(url=url)
tree = html.fromstring(page.content)
pledged = tree.xpath('//*[@id="react-project-header"]/div/div/div[3]/div/div[2]/div[1]/div[2]/span[1]/span/text()')
print("pledged: {}".format(pledged))
但是下面的代码返回了项目承诺的钱的真实价值:
url = 'https://www.kickstarter.com/projects/254683764/avoseedo-grow-your-own-avocodo-tree-with-ease'
page = requests.get(url=url)
tree = html.fromstring(page.content)
pledged = tree.xpath('//*[@id="content-wrap"]/div[2]/section[1]/div/div/div/div[1]/div/div[2]/div[2]/div[1]/h3/span/text()')
print("pledged: {}".format(pledged))
那么,我想知道有什么区别,为什么会这样?
【问题讨论】:
标签: python-3.x xpath web-scraping