I save the table as a SequenceFile format, and I set the following commands to enable Sequence with BLOCK Compression -
set mapred.output.compress=true; set mapred.output.compression.type=BLOCK; set mapred.output.compression.codec=org.apache.hadoop.io.compress.LzoCodec;
But when I tried looking at tables like this -
describe extended lip_table
I got information that has a field called compressed that is set to false . Thus, my data is not compressed by setting three commands:
Detailed Table Information Table(tableName:lip_table, dbName:default, owner:uname, createTime:1343931235, lastAccessTime:0, retention:0, sd:StorageDescriptor(cols: [FieldSchema(name:buyer_id, type:bigint, comment:null), FieldSchema(name:total_chkout, type:bigint, comment:null), FieldSchema(name:total_errpds, type:bigint, comment:null)], location:hdfs://ares-nn/apps/hdmi/uname/lip-data, inputFormat:org.apache.hadoop.mapred.SequenceFileInputFormat, outputFormat:org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat, **compressed:false**, numBuckets:-1, serdeInfo:SerDeInfo(name:null, serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe, parameters: {serialization.format= , field.delim=
source share