官网:https://github.com/scrapy-plugins/scrapy-splash

1、安装:

pip install scrapy-splash

2、运行splash

docker run -p 8050:8050 scrapinghub/splash

3、配置setting文件

DOWNLOADER_MIDDLEWARES = {
    'scrapy_splash.SplashCookiesMiddleware': 723,
    'scrapy_splash.SplashMiddleware': 725,
    'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware': 810,
}

SPIDER_MIDDLEWARES = {
    'scrapy_splash.SplashDeduplicateArgsMiddleware': 100,
}

SPLASH_URL = 'http://10.63.32.49:8050/'

# 过滤
DUPEFILTER_CLASS = 'scrapy_splash.SplashAwareDupeFilter'

# 缓存
HTTPCACHE_STORAGE = 'scrapy_splash.SplashAwareFSCacheStorage'

 

 

相关文章:

  • 2021-09-19
  • 2021-07-04
  • 2021-12-10
  • 2021-04-15
  • 2022-12-23
  • 2022-12-23
  • 2021-09-03
猜你喜欢
  • 2019-12-16
  • 2021-04-28
  • 2022-12-23
  • 2022-02-21
  • 2021-08-12
  • 2022-12-23
  • 2017-12-11
相关资源
相似解决方案