【发布时间】:2015-06-22 21:21:40
【问题描述】:
我有这段代码,它通过从现有列中提取信息来操作数据集以创建新列。为了使用 pd.merge 函数与另一个数据集正确匹配数据,我想将“通道 ID”列转换为整数。尽管当前使用 .astype(int),但结果数据类型显示为 float64,查看带有 .info() 的帧
def cost(received_frame):
received_frame.columns = ['Campaign', 'Ad Spend']
campaigns = received_frame['Campaign']
ID = []
for c in campaigns:
blocks = re.split('_', c)
for block in blocks[1:]:
if len(block) == 6 and block.isdigit():
ID.append(block)
ID = pd.Series(ID).str.replace("'","")
ID = pd.DataFrame(ID)
both = [ID,received_frame]
frame = pd.concat(both,axis=1)
frame.columns = ['Channel ID', 'Campaign', 'Ad Spend']
frame['Channel ID'] = frame['Channel ID'].dropna().astype(int)
return frame
【问题讨论】:
-
如果您能分享您正在处理的数据,将会很有帮助。