【发布时间】:2020-02-07 20:45:37
【问题描述】:
我在将 CSV 数据加载到雪花表时遇到问题。字段用双引号括起来,因此将它们导入表格时会出现问题。
我知道 COPY TO 具有 CSV 特定选项 FIELD_OPTIONALLY_ENCLOSED_BY = '"' 但它根本不起作用。
这里是一些表定义和复制命令的图片:
CREATE TABLE ...
(
GamePlayId NUMBER NOT NULL,
etc...
....);
COPY INTO ...
FROM ...csv.gz'
FILE_FORMAT = (TYPE = CSV
STRIP_NULL_VALUES = TRUE
FIELD_DELIMITER = ','
SKIP_HEADER = 1
error_on_column_count_mismatch=false
FIELD_OPTIONALLY_ENCLOSED_BY = '"'
)
ON_ERROR = "ABORT_STATEMENT"
;
CSV 文件如下所示:
"3922000","14733370","57256","2","3","2","2","2019-05-23 14:14:44",",00000000",",00000000",",00000000",",00000000","1000,00000000","1000,00000000","1317,50400000","1166,50000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000",",00000000"
我收到一个错误
'''Numeric value '"3922000"' is not recognized '''
我很确定这是因为当雪花读取“”标记时,NUMBER 值被解释为字符串,但是因为我使用了
FIELD_OPTIONALLY_ENCLOSED_BY = '"'
它甚至不应该在那里......有没有人对此有一些解决方案?
【问题讨论】:
-
通常情况下,数字和日期字段不会像这样引用。引用字段的目的是允许字段分隔符或记录分隔符包含在字段中,因此不应有这样做的理由。您可以导出文件以便不引用数字和日期时间字段吗?
标签: csv import snowflake-cloud-data-platform