Interface RFile.SummaryOptions
- All Known Subinterfaces:
RFile.SummaryFSOptions
- Enclosing class:
- RFile
public static interface RFile.SummaryOptions
This is an intermediate interface in a large builder pattern. Allows setting options for
retrieving summary data.
- Since:
- 2.0.0
-
Method Summary
Modifier and TypeMethodDescriptionendRow(CharSequence endRow) endRow(org.apache.hadoop.io.Text endRow) Summary data may possibly be stored at a more granular level than the entire file.read()Reads summary data from file.selectSummaries(Predicate<SummarizerConfiguration> summarySelector) This method allows retrieving a subset of summary data from a file.startRow(CharSequence startRow) startRow(org.apache.hadoop.io.Text startRow) Summary data may possibly be stored at a more granular level than the entire file.withTableProperties(Iterable<Map.Entry<String, String>> props) Retrieve summaries with provided tables properties.withTableProperties(Map<String, String> props) Any property that impacts file behavior regardless of whether it has theProperty.TABLE_PREFIXmay be accepted and used.
-
Method Details
-
withTableProperties
Retrieve summaries with provided tables properties. Properties for a table can be obtained by callingTableOperations.getProperties(String). Any property that impacts file behavior regardless of whether it has theProperty.TABLE_PREFIXmay be accepted and used. For example, cache and crypto properties could be passed here.- Parameters:
props- iterable over Accumulo table key value properties.- Returns:
- this
-
withTableProperties
Any property that impacts file behavior regardless of whether it has theProperty.TABLE_PREFIXmay be accepted and used. For example, cache and crypto properties could be passed here.- Parameters:
props- a map instead of an Iterable- Returns:
- this
- See Also:
-
selectSummaries
This method allows retrieving a subset of summary data from a file. If a file has lots of separate summaries, reading a subset may be faster.- Parameters:
summarySelector- Only read summary data that was generated with configuration that this predicate matches.- Returns:
- this
-
startRow
Summary data may possibly be stored at a more granular level than the entire file. However there is no guarantee of this. If the data was stored at a more granular level, then this will get a subset of the summary data. The subset will very likely be an inaccurate approximation.- Parameters:
startRow- A non-null start row. The startRow is used exclusively.- Returns:
- this
- See Also:
-
startRow
- Parameters:
startRow- UTF-8 encodes startRow. The startRow is used exclusively.- Returns:
- this
- See Also:
-
endRow
Summary data may possibly be stored at a more granular level than the entire file. However there is no guarantee of this. If the data was stored at a more granular level, then this will get a subset of the summary data. The subset will very likely be an inaccurate approximation.- Parameters:
endRow- A non-null end row. The end row is used inclusively.- Returns:
- this
- See Also:
-
endRow
- Parameters:
endRow- UTF-8 encodes endRow. The end row is used inclusively.- Returns:
- this
- See Also:
-
read
Reads summary data from file.- Returns:
- The summary data in the file that satisfied the selection criteria.
- Throws:
IOException
-