【发布时间】:2021-01-18 04:55:24
【问题描述】:
导入scrapy 从 ..items 导入 MatchesfashionItem
类 SpiderSpider(scrapy.Spider): 名称 = '蜘蛛'
start_urls = ['https://www.matchesfashion.com/intl/mens/shop/shoes']
def parse(self, response):
items=MatchesfashionItem()
Name = response.css('.lister__item__details::text').extract()
Brand = response.css('.lister__item__title::text').extract()
Price = response.css('.lister__item__price-full::text').extract()
Image_Url = response.css('div.lister__item__image productView').css('a').css('img::attr(src)')
Product_Url = response.css('.lister__item__inner').css('div.lister__item__image').css('a::attr(href)').extract()
items['Name'] = Name
items['Brand'] = Brand
items['Price'] = Price
items['Image_Url'] = Image_Url
items['Product_Url'] = Product_Url
yield items
【问题讨论】:
-
可能是因为最后缺少
.extract()?
标签: python web-scraping scrapy