一、使用scrapy创建一个项目
这里使用scrapy官方第一个示例
scrapy startproject tutorial
使用PyCharm打开项目,在tutorial/tutorial/spiders目录下创建quotes_spider.py文件并写入,以下代码
import scrapy class QuotesSpider(scrapy.Spider): name = "quotes" def start_requests(self): urls = [ 'http://quotes.toscrape.com/page/1/', 'http://quotes.toscrape.com/page/2/', ] for url in urls: yield scrapy.Request(url=url, callback=self.parse) def parse(self, response): page = response.url.split("/")[-2] filename = 'quotes-%s.html' % page with open(filename, 'wb') as f: f.write(response.body) self.log('Saved file %s' % filename)