【发布时间】:2020-11-03 17:59:28
【问题描述】:
我正在尝试从包含当前利率的网站上抓取表格。我用 python 和漂亮的汤,但我找不到 html 部分。请发送帮助!谢谢。
我只需要抓取当前利率表,而不是其他所有内容并将其转换为 csv 文件。这是我网站的链接:https://www.global-rates.com/en/interest-rates/libor/american-dollar/usd-libor-interest-rate-12-months.aspx 这是当前利率表的图片:
我尝试过这样的事情:
import bs4
import requests
from bs4 import BeautifulSoup
import pandas as pd
URL = 'https://www.global-rates.com/en/interest-rates/libor/american-dollar/usd-libor-interest-rate-12-months.aspx'
response = requests.get(URL)
soup=bs4.BeautifulSoup(response.content, 'html.parser')
print(soup.title)
print(soup.title.string)
print(len(response.text))
table = soup.find('table', attrs = {'class':'tableheader'}).tbody
print(table)
columns = ['Current interest rates']
df = pd.DataFrame(columns = columns)
trs = table.find_all('tr')
for tr in trs:
tds = tr.find_all('td')
row = [td.text.replace('\n', '') for td in tds]
df = df.append(pd.Series(row, index = columns), ignore_index = True)
df.to_csv('libor.csv', index = False)
但这给了我属性错误:“None Type'对象没有属性'tbody'
哦,如果可能的话,我还想自动抓取星期一的利率。 谢谢你的帮助
【问题讨论】:
标签: python-3.x web-scraping beautifulsoup