【发布时间】:2019-10-06 15:19:54
【问题描述】:
我正在尝试从该网站https://www.gumtree.co.za 抓取信息,但是我不确定如何获取该属性的 URL。
这就是我所拥有的:
class GumtreeSpider(scrapy.Spider):
name = "gumtree"
start_urls = ['https://www.gumtree.co.za/s-house-rentals-flat-rentals-offered/cape-town/v1c9071l3100006p1',
'https://www.gumtree.co.za/s-houses-flats-for-sale/cape-town/v1c9074l3100006p1']
def parse(self, response):
for prop in response.css('div.tileV1'):
link = 'https://www.gumtree.co.za' + prop.css('div.title a.tile-title-text::attr(href)').get()
我尝试了多种组合,但似乎无法正确使用。有什么建议么? 谢谢!
【问题讨论】:
标签: python xpath web-scraping scrapy web-crawler