【问题标题】:Webscrape Beautifulsoup on website (get multiple hrefs)Web Scrape Beautifulsoup 在网站上(获取多个 ahrefs)
【发布时间】:2022-08-22 15:46:29
【问题描述】:

我想从网上抓取这个网页(carbuzz.com)。我想获取从“Acura”到“Volvo”(link to picture)的所有汽车品牌的链接(href)。

目前,我只得到第一个条目(Acura)。我如何获得剩余的?当我刚开始抓取和编码时,非常感谢您的输入!

代码:

from bs4 import BeautifulSoup
import requests
import time

#Inputs/URLs to scrape:
URL2 = (\'https://carbuzz.com/cars\')
(response := requests.get(URL2)).raise_for_status()
soup = BeautifulSoup(response.text, \'lxml\')
overview = soup.find()

car_brand = overview.find(class_=\'bg-make-preview\')[\'href\']
car_brand_url =\'https://carbuzz.com\'+car_brand

print(car_brand_url)

输出:

[Finished in 1.2s]

    标签: python html web-scraping beautifulsoup


    【解决方案1】:

    您可以使用find_all 来获取类名bg-make-preview 的标签。

    soup = BeautifulSoup(response.text, 'lxml')
    
    for elem in soup.find_all(class_='bg-make-preview'):
        car_brand_url ='https://carbuzz.com' + elem['href']
        print(car_brand_url)
    

    这给了我们预期的输出

    https://carbuzz.com/cars/acura
    https://carbuzz.com/cars/alfa-romeo
    https://carbuzz.com/cars/aston-martin
    https://carbuzz.com/cars/audi
    https://carbuzz.com/cars/bentley
    https://carbuzz.com/cars/bmw
    https://carbuzz.com/cars/bollinger
    https://carbuzz.com/cars/bugatti
    https://carbuzz.com/cars/buick
    https://carbuzz.com/cars/cadillac
    https://carbuzz.com/cars/caterham
    https://carbuzz.com/cars/chevrolet
    https://carbuzz.com/cars/chrysler
    https://carbuzz.com/cars/dodge
    https://carbuzz.com/cars/ferrari
    https://carbuzz.com/cars/fiat
    https://carbuzz.com/cars/fisker
    https://carbuzz.com/cars/ford
    https://carbuzz.com/cars/genesis
    https://carbuzz.com/cars/gmc
    https://carbuzz.com/cars/hennessey
    https://carbuzz.com/cars/honda
    https://carbuzz.com/cars/hyundai
    https://carbuzz.com/cars/infiniti
    https://carbuzz.com/cars/jaguar
    https://carbuzz.com/cars/jeep
    https://carbuzz.com/cars/karma
    https://carbuzz.com/cars/kia
    https://carbuzz.com/cars/koenigsegg
    https://carbuzz.com/cars/lamborghini
    https://carbuzz.com/cars/land-rover
    https://carbuzz.com/cars/lexus
    https://carbuzz.com/cars/lincoln
    https://carbuzz.com/cars/lordstown
    https://carbuzz.com/cars/lotus
    https://carbuzz.com/cars/lucid
    https://carbuzz.com/cars/maserati
    https://carbuzz.com/cars/mazda
    https://carbuzz.com/cars/mclaren
    https://carbuzz.com/cars/mercedes-benz
    https://carbuzz.com/cars/mini
    https://carbuzz.com/cars/mitsubishi
    https://carbuzz.com/cars/nissan
    https://carbuzz.com/cars/pagani
    https://carbuzz.com/cars/polestar
    https://carbuzz.com/cars/porsche
    https://carbuzz.com/cars/ram
    https://carbuzz.com/cars/rimac
    https://carbuzz.com/cars/rivian
    https://carbuzz.com/cars/rolls-royce
    https://carbuzz.com/cars/spyker
    https://carbuzz.com/cars/subaru
    https://carbuzz.com/cars/tesla
    https://carbuzz.com/cars/toyota
    https://carbuzz.com/cars/volkswagen
    https://carbuzz.com/cars/volvo
    https://carbuzz.com/cars/hummer
    https://carbuzz.com/cars/maybach
    https://carbuzz.com/cars/mercury
    https://carbuzz.com/cars/pontiac
    https://carbuzz.com/cars/saab
    https://carbuzz.com/cars/saturn
    https://carbuzz.com/cars/scion
    https://carbuzz.com/cars/smart
    https://carbuzz.com/cars/suzuki
    

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 2020-10-09
      • 2020-06-27
      • 1970-01-01
      • 1970-01-01
      • 2018-05-13
      • 1970-01-01
      • 2022-08-04
      • 1970-01-01
      相关资源
      最近更新 更多