To improve Big SQL query performance one should specify the INPUTFORMAT and OUTPUTFORMAT during which of the following?
The Hadoop environment within the IBM InfoSphere BigInsights Version 3.0 can read a large number of storage formats. This flexibility is partially because of the INPUTFORMAT and OUTPUTFORMAT classes that you can specify on the CREATE and ALTER table statements and because of the use of installed and customized SerDe classes. The file formats listed here are available either by using explicit SQL syntax, such as STORED AS PARQUETFILE, or by using installed interfaces, such as Avro.
https://developer.ibm.com/hadoop/2014/09/19/big-sql-3-0-file-formats-usage-performance/
Currently there are no comments in this discussion, be the first to comment!