【发布时间】:2018-04-06 10:34:41
【问题描述】:
我有一个 df,
Name Step Description
Ram 1 Ram is oNe of the good cricketer
Ram 2 gopal one
Sri 1 Sri is one of the member
Sri 2 ravi good
Kumar 1 Kumar is a keeper
Madhu 1 good boy
Vignesh 1 oNe little
Pechi 1 one book
mario 1 good randokm
Roger 1 one milita good
bala 1 looks good
raj 1 more one
venk 1 likes good
还有一个列表,
my_list=["one","good"]
我正在尝试从 my_list 中获取至少包含一个关键字的行。
我试过了, mask=df["描述"].str.contains("|".join(my_list),na=False) 我正在获取 output_df,
Name Description
Ram Ram is one of the good cricketer
Sri Sri is one of the member
我还想将“描述”中存在的关键字及其计数添加到单独的列中,
当 df["Name"] 不是第一次出现时,即使“描述”包含关键字,也不应该在键列中复制关键字我想要的输出是,
my_desired 输出是,
Name Step Description keys count
Ram 1 Ram is one of the good cricketer one,good 2
Ram 2 gopal one
Sri 1 Sri is one of the member one 1
Sri 2 ravi good
Kumar 1 Kumar is a keeper
Madhu 1 good boy good 1
Vignesh 1 oNe little oNe 1
Pechi 1 one book one 1
mario 1 good randokm good good 1
Roger 1 one milita good one,good 2
bala 1 looks good good 1
raj 1 more one one 1
venk 1 likes good good 1
【问题讨论】:
标签: python pandas dataframe data-analysis