【发布时间】:2020-04-23 10:55:30
【问题描述】:
如何将完整列表合并到数据框中。当我打印时,它似乎只打印第一条记录,它还包括 \n 和其他冗余,如 ' 等。
import requests
from requests_html import HTML, HTMLSession
from bs4 import BeautifulSoup
import pandas as pd
import csv
import json
url = 'https://lehighsports.com/sports/mens-soccer/schedule/2018'
lehigh = requests.get(url).text
soup = BeautifulSoup(lehigh,'lxml')
for opp in soup.find_all('div',class_="sidearm-schedule-game-opponent-text"):
opp_list = []
opp_list.append(opp.text)
# print(opp_list)
for conf in soup.find_all('div',class_="sidearm-schedule-game-conference-conference"):
conf_list = []
conf_list.append(conf.text)
# print(conf_list)
dict = {'opponent':[opp_list],'conference':[conf_list]}
df = pd.DataFrame(dict)
print(df)
【问题讨论】:
标签: python-3.x beautifulsoup python-requests removing-whitespace