【发布时间】:2021-08-18 13:01:33
【问题描述】:
我有什么:
import pandas as pd
inp = [{'long string':'ha: (tra: 1 la: 2) \n hi: (tra: 1 la: 2) \n ho: (tra: 1 la: 2)'},
{'long string':'hi: (tra: 1 la: 2) \n ha: (tra: 1 la: 2) \n ho: (tra: 1 la: 2)'},
{'long string':'ho: (tra: 1 la: 2) \n hi: (tra: 1 la: 2) \n ha: (tra: 1 la: 2)'}]
df = pd.DataFrame(inp)
df
给予
long string
0 ha: (tra: 1 la: 2) \n hi: (tra: 1 la: 2) \n ho...
1 hi: (tra: 1 la: 2) \n ha: (tra: 1 la: 2) \n ho...
2 ho: (tra: 1 la: 2) \n hi: (tra: 1 la: 2) \n ha...
我想要什么
inp = {'ha-tra':['1', '1', '1'], 'ha-la':['2', '2', '2'], 'hi-tra':['1', '1', '1'], 'hi-la':['2', '2', '2'],'ho-tra':['1', '1', '1'], 'ho-la':['2', '2', '2']}
df = pd.DataFrame(inp)
df
给予
ha-tra ha-la hi-tra hi-la ho-tra ho-la
0 1 2 1 2 1 2
1 1 2 1 2 1 2
2 1 2 1 2 1 2
上下文
我想从一个大字符串中获取 (ha hi ho) 和 (tra la) 的每个组合,并从字符串中获取与这些组合相关的分数。问题是(ha hi ho)的顺序不一样。
【问题讨论】:
标签: python regex pandas dataframe