【发布时间】:2018-05-02 15:05:34
【问题描述】:
我有一个 Bigquery 表中的文档列表。其中一些名称非常相似。我需要检查每一对文档,看看它们有多少相同的单词,所以我可以建议删除其中的一个。
例如:
Spreadsheets
Quality Control.xlsx
Product Structure.xlsx
Invoices Sent April.xslx
Invoices Sent March.xlsx
Total Costs April.xlsx
Total Costs March.xlsx
Process of Quality Control.xlsx`
我会得到这样的结果
Spreadsheet |Matching Spreadsheet |Words
Quality Control.xlsx |Process of Quality Control.xlsx |2
Product Structure.xlsx |null |null
Invoices Sent April.xslx |Invoices Sent March.xlsx |2
Invoices Sent March.xlsx |Invoices Sent April.xlsx |2
Total Costs April.xlsx |Total Costs March.xlsx |2
Total Costs March.xlsx |Total Costs April.xlsx |2
Process of Quality Control.xlsx |Quality Control.xlsx |2
【问题讨论】:
-
你应该提供更多细节!
-
刚刚更新了描述。希望我的问题更清楚
标签: google-bigquery string-comparison bigquery-standard-sql