【发布时间】:2020-03-17 20:27:26
【问题描述】:
class AmazonspiderSpider(scrapy.Spider):
start_urls = ['https://www.amazon.co.uk/s?k=9780297833697']
def parse(self, response):
Items = AmazonappItem()
soup = BeautifulSoup(response.text, 'lxml')
book = soup.find("a", {"class": "a-link-normal a-text-normal"})
link = book.get('href')
myurl = "https://www.amazon.co.uk" + link
Items['bookurl'] = myurl
我在 myurl 中找到了新链接,现在我需要关注这个新链接。怎么做?
【问题讨论】:
标签: python web-scraping beautifulsoup scrapy