【发布时间】:2019-03-13 05:00:47
【问题描述】:
我有一小段代码可以从网站上抓取表格数据,然后以 csv 格式显示。问题是 for 循环多次打印记录。我不确定这是否是由于
标签。顺便说一句,我是 Python 新手。感谢您的帮助!
#import needed libraries
import urllib
from bs4 import BeautifulSoup
import requests
import pandas as pd
import csv
import sys
import re
# read the data from a URL
url = requests.get("https://www.top500.org/list/2018/06/")
# parse the URL using Beauriful Soup
soup = BeautifulSoup(url.content, 'html.parser')
newtxt= ""
for record in soup.find_all('tr'):
tbltxt = ""
for data in record.find_all('td'):
tbltxt = tbltxt + "," + data.text
newtxt= newtxt+ "\n" + tbltxt[1:]
print(newtxt)
【问题讨论】:
标签: python beautifulsoup