【发布时间】:2019-04-29 18:35:47
【问题描述】:
我在 GA BigQuery 导出中计算唯一身份用户时遇到问题。我使用示例数据重现了相同的错误。
SELECT sum(users) as users, sum(sessions) as sessions FROM (
SELECT
h.page.pagePath as page_path,
trafficSource.source,
trafficSource.medium,
COUNT(DISTINCT(fullVisitorId)) AS users,
COUNT(*) as sessions
FROM
`bigquery-public-data.google_analytics_sample.ga_sessions_20170101`, UNNEST(hits) h
WHERE h.page.pagePath = "/home"
GROUP BY page_path, source, medium
)
UNION ALL
SELECT sum(users) as users, sum(sessions) as sessions FROM (
SELECT
h.page.pagePath as page_path,
COUNT(DISTINCT(fullVisitorId)) AS users,
COUNT(*) as sessions
FROM
`bigquery-public-data.google_analytics_sample.ga_sessions_20170101`, UNNEST(hits) h
WHERE h.page.pagePath = "/home"
GROUP BY page_path
)
当我包含 source 和 medium 列时,不同的 fullVisitorId 计数比没有它们时高 10。包括这些列是如何导致fullVisitorIds 数量增加的?这对我来说没有意义。
这是什么原因造成的,我该如何获得准确的计数?
【问题讨论】:
标签: google-analytics google-bigquery