【发布时间】:2021-09-14 20:14:25
【问题描述】:
我正在将 nfl 数据抓取到数据框中,但输出是将每个团队放入每个团队的单独数据框中。我怎样才能得到它,以便输出将每一行放入所有相同的 DF 中?
team_abbreviations = ['buf','mia','nwe','nyj','htx','clt','jax','oti','cin','pit',
'cle','rav','den','kan','rai','sdg','phi','dal','nyg','was',
'car','nor','tam','atl','chi','det','gnb','min','crd','ram','sea','sfo']
for i in team_abbreviations:
url = 'https://www.pro-football-reference.com/teams/{0}/2021/gamelog/'.format(i)
data = pd.read_html(url)[0]
data.rename(columns={'Unnamed: 3_level_1': 'Box_link'}, inplace=True)
data.rename(columns={'Unnamed: 4_level_1': 'W/L'}, inplace=True)
data.rename(columns={'Unnamed: 6_level_1': '@'}, inplace=True)
data.rename(columns={'Unnamed: 3_level_1': 'Box_link'}, inplace=True)
data.columns = ['Week', 'Day', 'Date', 'Box_link', 'W/L', 'OT', 'Home/Away', 'Opp_Team', 'Tm_Score',
'Opp_Score', 'PassCmp', 'PassAtt', 'PassYds', 'PassTD', 'Int', 'Sk', 'YdsLost_Sk', 'PassY/A',
'PassNY/A', 'Cmp%', 'PasserRate', 'RushAtt', 'RushYds', 'RushY/A', 'RushTD', 'FGM', 'FGA', 'XPM',
'XPA', 'Pnt', 'PntYds', '3DConv', '3DAtt', '4DConv', '4DAtt', 'ToP']
data = data[pd.notnull(data['Opp_Score'])]
print(data)
【问题讨论】:
标签: python pandas dataframe screen-scraping