【发布时间】:2020-12-08 03:43:54
【问题描述】:
有很多关于在 BigQuery 中取消嵌套重复字段的帖子 - 但是,对于这个环境,我几乎尝试了所有我发现的代码变体来展平数据文件。但是,如果不在id 字段中创建空白,我似乎无法生成一个。看来我需要展开嵌套变量?
我正在使用属于公共数据集的 COVID 维度数据集。这是一些产生我的问题的最小代码:
SELECT
id,
authors
FROM
`covid-19-dimensions-ai.data.publications`
CROSS JOIN
UNNEST(authors)
LIMIT 1000
而且,这是运行此查询后的 JSON 结构。一切都被我想要的结构展平了,但我不知道如何填写/避免空白 id 变量。
{
"id": "pub.1130234899",
"authors": {
"first_name": "Eric M",
"last_name": "Yoshida",
"initials": null,
"researcher_id": "ur.01071531321.03",
"grid_ids": [
"grid.17091.3e"
],
"corresponding": false,
"raw_affiliations": [
"Division of Gastroenterology, University of British Columbia, Vancouver, British Columbia, Canada"
],
"affiliations_address": [
{
"grid_id": "grid.17091.3e",
"city_id": "6173331",
"state_code": "CA-BC",
"country_code": "CA",
"raw_affiliation": "Division of Gastroenterology, University of British Columbia, Vancouver, British Columbia, Canada"
}
]
}
}
【问题讨论】:
标签: google-bigquery