【发布时间】:2016-04-24 20:08:58
【问题描述】:
我正在尝试使用 pandas.read_csv 导入 csv 文件。文件如下:
"COL_A","COL_B","COL_C"
"ROW1COLA","ROW1COLB","ROW1COLC","ROW1COLD"
"ROW2COLA","ROW2COLB","ROW2COLC","ROW2COLD"
"ROW3COLA","ROW3COLB","ROW3COLC","ROW3COLD"
"ROW4COLA","ROW4COLB","ROW4COLC","ROW4COLD"
"ROW5COLA","ROW5COLB","ROW5COLC","ROW5COLD"
"ROW6COLA","ROW6COLB","ROW6COLC","ROW6COLD"
"ROW7COLA","ROW7COLB","ROW7COLC","ROW7COLD"
在第一次尝试中我跑了:
data = pd.read_csv('broken.csv')
我得到了:
COL_A COL_B COL_C
ROW1COLA ROW1COLB ROW1COLC ROW1COLD
ROW2COLA ROW2COLB ROW2COLC ROW2COLD
ROW3COLA ROW3COLB ROW3COLC ROW3COLD
ROW4COLA ROW4COLB ROW4COLC ROW4COLD
ROW5COLA ROW5COLB ROW5COLC ROW5COLD
ROW6COLA ROW6COLB ROW6COLC ROW6COLD
ROW7COLA ROW7COLB ROW7COLC ROW7COLD
设置 index_col=False
data = pd.read_csv('broken.csv',index_col=False)
我明白了
COL_A COL_B COL_C
0 ROW1COLA ROW1COLB ROW1COLC
1 ROW2COLA ROW2COLB ROW2COLC
2 ROW3COLA ROW3COLB ROW3COLC
3 ROW4COLA ROW4COLB ROW4COLC
4 ROW5COLA ROW5COLB ROW5COLC
5 ROW6COLA ROW6COLB ROW6COLC
6 ROW7COLA ROW7COLB ROW7COLC
如果我添加前缀 = 'X'
data = pd.read_csv('broken.csv',index_col=False,prefix='X')
我明白了
COL_A COL_B COL_C
0 ROW1COLA ROW1COLB ROW1COLC
1 ROW2COLA ROW2COLB ROW2COLC
2 ROW3COLA ROW3COLB ROW3COLC
3 ROW4COLA ROW4COLB ROW4COLC
4 ROW5COLA ROW5COLB ROW5COLC
5 ROW6COLA ROW6COLB ROW6COLC
6 ROW7COLA ROW7COLB ROW7COLC
与 read_table 相同
data = pd.read_table('broken.csv',index_col=True,sep=',')
我想知道 pandas 是否有任何方式自动分配标题并获取缺少的标题列的值
【问题讨论】: