【发布时间】:2021-06-07 00:24:06
【问题描述】:
我有一个数据框
df = pd.DataFrame([["A",1,98,88,"",567,453,545,656,323,756], ["B",1,99,"","",231,232,234,943,474,345], ["C",1,97,67,23,543,458,456,876,935,876], ["B",1,"",79,84,895,237,678,452,545,453], ["A",1,45,"",58,334,778,234,983,858,657], ["C",1,23,55,"",183,565,953,565,234,234]], columns=["id","date","col1","col2","col3","col1_num","col1_deno","col3_num","col3_deno","col2_num","col2_deno"])
我需要为列名的 _num 和 _deno 分别设置 Nan/blank 值。例如:如果 "col1" 的特定行为空白,则将 "col1_num" 和 "col1_deno" 的值设为 Nan/blank。基于 "col2" 对 "col2_num" 和 "col2_deno" 以及 "col3_num" 重复相同的过程> 和 "col3_deno" 基于 "col3"。
预期输出:
df_out = pd.DataFrame([["A",1,98,88,"",567,453,"","",323,756], ["B",1,99,"","",231,232,"","","",""], ["C",1,97,67,23,543,458,456,876,935,876], ["B",1,"",79,84,"","",678,452,545,453], ["A",1,45,"",58,334,778,234,983,"",""], ["C",1,23,55,"",183,565,"","",234,234]], columns=["id","date","col1","col2","col3","col1_num","col1_deno","col3_num","col3_deno","col2_num","col2_deno"])
怎么做?
【问题讨论】:
标签: python python-3.x pandas python-2.7 dataframe