【发布时间】:2020-06-01 07:17:39
【问题描述】:
我有一个 pandas 数据框,其 to_json(orient="records") 格式的每个条目都返回:
{
"applicationType": "IMPALA",
"user": "root",
"id": "705c64ad",
"category_2": "{ \"tag\": \"uncategorised\",\"tag_type\":\"uncategorised\" }",
"category_5": "{ \"tag\": \"HR\",\"tag_type\":\"Management\" }",
"category_8": "{ \"tag\": \"uncategorised\",\"tag_type\":\"uncategorised\" }"
}
如果使用 to_dict(orient="records") 操作 Dataframe,它将以 Dictionary 格式返回以下数据:
{
'applicationType': 'IMPALA',
'user': 'root',
'id': '705c64ad',
'category_2': '{ "tag": "uncategorised","tag_type":"uncategorised" }',
'category_5': '{ "tag": "HR","tag_type":"Management" }',
'category_8': '{ "tag": "uncategorised","tag_type":"uncategorised" }'
}
我想处理上述数据以生成一个 JSON,该 JSON 应该是一个 SET,在一个名为“category”的标签中为数据框的所有“category_*”列具有唯一条目,如下所示
{
"applicationType": "IMPALA",
"user": "root",
"id": "705c64ad",
"category": [{ "tag": "uncategorised","tag_type":"uncategorised" }, { "tag": "HR","tag_type":"Management" }]
}
【问题讨论】:
标签: python python-3.x pandas list set