【发布时间】:2012-08-02 19:54:32
【问题描述】:
我将表格存储为 SequenceFile 格式,并设置以下命令以启用带有块压缩的序列-
set mapred.output.compress=true;
set mapred.output.compression.type=BLOCK;
set mapred.output.compression.codec=org.apache.hadoop.io.compress.LzoCodec;
但是当我尝试像这样查看表格时-
describe extended lip_table
我得到以下信息,其中有一个名为compressed的字段设置为false,那么这意味着我的数据没有通过设置上述三个命令得到压缩?
Detailed Table Information Table(tableName:lip_table, dbName:default, owner:uname,
createTime:1343931235, lastAccessTime:0, retention:0, sd:StorageDescriptor(cols:
[FieldSchema(name:buyer_id, type:bigint, comment:null), FieldSchema(name:total_chkout,
type:bigint, comment:null), FieldSchema(name:total_errpds, type:bigint, comment:null)],
location:hdfs://ares-nn/apps/hdmi/uname/lip-data,
inputFormat:org.apache.hadoop.mapred.SequenceFileInputFormat,
outputFormat:org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat,
**compressed:false**, numBuckets:-1, serdeInfo:SerDeInfo(name:null,
serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, parameters:
{serialization.format= , field.delim=
【问题讨论】:
-
尝试
describe formatted获得describe extended的漂亮打印 -
我正在运行 Hive 0.6,它不支持
describe formatted。 :(