【发布时间】:2021-03-30 12:55:44
【问题描述】:
我需要在<p> 元素和class='fs-s fw-light pl-4 pr-4' 中为网站上的每个房屋打印文本。我尝试这种方式,但它立即开始和结束。
import gspread
import requests
import datetime
from bs4 import BeautifulSoup
from datetime import timedelta
datetime.datetime.now()
#zaklad pro scraping webu
URL = 'https://www.mmreality.cz/nemovitosti/prodej/byty/praha-vychod/?query=bcw9DoAgDIbh2zgjiro4eQzCQBB%2FErUEigm3F3Ss69Ovr4HzhEsG1BiDkkyNdWU%2B0wb32xZYLaxeuy3J2S46HjhBvNCn9%2BYhupAfJSZnvwRnjHqtRiEo8zL%2FyTR53lFuMw%2BUReaeclfi%2FAE%3D'
page = requests.get(URL)
soup = BeautifulSoup(page.content, 'html.parser')
#Scraping
results = soup.find_all('id', attrs={'class':'cell xsmall-24 mb-4 grid-y pos-relative rounded pseudo-border pseudo-border-primary-lightest pseudo-border-on-hover-none pseudo-box-shadow-xlarge-on-hover medium-12 xlarge-8'})
for job_data in results:
nemovitost = job_data.find('p', attrs={'class':'fs-s fw-light pl-4 pr-4'})
nemovitost_final = nemovitost.text.strip()
print(nemovitost_final)
我使用for job_data in results 尝试此代码 - 用于 id,因为它是我要打印的组元素的标题。在元素<p> 之后我想打印<strong class="block text-secondary fs-xxl pl-4 pr-4 mb-4">6 800 000 Kč</strong> 等。
我尝试从这个网站打印文本:www.mmreality.cz
【问题讨论】:
标签: python-3.x web-scraping python-requests