【发布时间】:2022-10-13 15:41:36
【问题描述】:
我想从现有数据框创建子类别
数据框列包括(示例表)我在列级别需要的更改不是数据中的任何更改,例如一组列是列名和列名 3 个不同的后缀(很少有类似的列名和其他列名)
像这样的例子
|payer_id|payer_name|halo_payer_name|delta_payer_name|halo_desc|delta_desc|halo_operations|delta_notes|halo_processed_data|delta_processed_data|额外|insurance_company|
我希望它被分组到这个光环组 halo_payer_name|halo_desc|halo_operations|halo_processed_data|
我希望它被分组到这个增量组 delta_payer_name|delta_desc|delta_notes|delta_processed_data|
其余列为一组
所以当转换为 JSON 时,它会出现在这个布局中
{
"schema": {
"fields": [{
"payer_details": [{
"name": "payer_id",
"type": "string"
},
{
"name": "payer_name",
"type": "string"
},
{
"name": "extra",
"type": "string"
},
{
"name": "insurance_company",
"type": "string"
}
]
},
{
"halo": [{
"name": "halo_payer_name",
"type": "string"
},
{
"name": "halo_desc",
"type": "string"
},
{
"name": "halo_operstions",
"type": "string"
},
{
"name": "halo_processed_data",
"type": "string"
}
]
}, {
"delta": [{
"name": "delta_payer_name",
"type": "string"
},
{
"name": "delta_desc",
"type": "string"
},
{
"name": "delta_notes",
"type": "string"
},
{
"name": "delta_processed_data",
"type": "string"
}
]
}
],
"pandas_version": "1.4.0"
},
"masterdata": [{
"payer_details": [{
"payer_id": "",
"payer_name": "",
"extra": "",
"insurance_company": ""
}],
"halo": [{
"halo_payer_name": "",
"halo_desc": "",
"halo_operations": "",
"halo_processed_data": "",
}],
"delta":[{
"delta_payer_name": "",
"delta_desc": "",
"delta_notes": "",
"delta_processed_data": "",
}]
}]
}
对于这种情况,我找不到解决方案,因为它是基于列的分组而不是基于数据的分组
【问题讨论】: