-
Sets the compression type to use for data blocks, overriding the default.
FileOutputFormatBuilder.OutputOptions.dataBlockSize(long dataBlockSize)
Sets the size for data blocks within each file.
Data blocks are a span of key/value pairs stored in the file that are compressed and indexed
as a group.
FileOutputFormatBuilder.OutputOptions.fileBlockSize(long fileBlockSize)
Sets the size for file blocks in the file system; file blocks are managed, and replicated, by
the underlying file system.
FileOutputFormatBuilder.OutputOptions.indexBlockSize(long indexBlockSize)
Sets the size for index blocks within each file; smaller blocks means a deeper index
hierarchy within the file, while larger blocks mean a more shallow index hierarchy within the
file.
FileOutputFormatBuilder.PathParams.outputPath(org.apache.hadoop.fs.Path path)
Set the Path of the output directory for the map-reduce job.
FileOutputFormatBuilder.OutputOptions.replication(int replication)
Sets the file system replication factor for the resulting file, overriding the file system
default.
Specify a sampler to be used when writing out data.
Specifies a list of summarizer configurations to create summary data in the output file.