【发布时间】:2019-01-20 00:48:21
【问题描述】:
https://www.ptv.vic.gov.au/next5/diva/10018306/line/9777/2
我正在尝试获取时间/时间(出发时间)和目的地,但页面每 60 秒刷新一次,我无法获取该信息。
这是我迄今为止尝试过的:
from bs4 import BeautifulSoup
import requests
from user_agent import generate_user_agent
from requests import get
headers = {'User-Agent': generate_user_agent(device_type="desktop", os=('mac', 'linux'))}
url = 'https://www.ptv.vic.gov.au/next5/diva/10004556/line/11613/2'
response = get(url)
html_soup = BeautifulSoup(response.text, 'html.parser')
type(html_soup)
datatest = html_soup.find_all('div', class_='timetable')
print(type(datatest))
print(len(datatest))
我想从网站上获取至少 3 个即将到来的时间和目的地。
【问题讨论】:
标签: python python-3.x web-scraping beautifulsoup python-requests