Class WholeColumnFamilyIterator
java.lang.Object
org.apache.accumulo.core.iterators.user.WholeColumnFamilyIterator
- All Implemented Interfaces:
OptionDescriber,SortedKeyValueIterator<Key,,Value> YieldingKeyValueIterator<Key,Value>
public class WholeColumnFamilyIterator
extends Object
implements SortedKeyValueIterator<Key,Value>, OptionDescriber
The WholeColumnFamilyIterator is designed to provide row/cf-isolation so that queries see
mutations as atomic. It does so by grouping row/Column family (as key) and rest of data as Value
into a single key/value pair, which is returned through the client as an atomic operation.
To regain the original key/value pairs of the row, call the decodeRow function on the key/value
pair that this iterator returned.
- Since:
- 1.6.0
-
Nested Class Summary
Nested classes/interfaces inherited from interface org.apache.accumulo.core.iterators.OptionDescriber
OptionDescriber.IteratorOptions -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptiondecodeColumnFamily(Key rowKey, Value rowValue) Decode whole row/cf out of value.Creates a deep copy of this iterator as though seek had not yet been called.Gets an iterator options object that contains information needed to configure this iterator.static final ValueencodeColumnFamily(List<Key> keys, List<Value> values) Encode row/cf.protected booleanReturns top key.Returns top value.booleanhasTop()Returns true if the iterator has more elements.voidinit(SortedKeyValueIterator<Key, Value> source, Map<String, String> options, IteratorEnvironment env) Initializes the iterator.voidnext()Advances to the next K,V pair.voidseek(Range range, Collection<ByteSequence> columnFamilies, boolean inclusive) Seeks to the first key in the Range, restricting the resulting K,V pairs to those with the specified columns.booleanvalidateOptions(Map<String, String> options) Check to see if an options map contains all options required by an iterator and that the option values are in the expected formats.Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.accumulo.core.iterators.YieldingKeyValueIterator
enableYielding
-
Constructor Details
-
WholeColumnFamilyIterator
public WholeColumnFamilyIterator()
-
-
Method Details
-
decodeColumnFamily
public static final SortedMap<Key,Value> decodeColumnFamily(Key rowKey, Value rowValue) throws IOException Decode whole row/cf out of value. decode key value pairs that have been encoded into a single // value- Parameters:
rowKey- the row key to decoderowValue- the value to decode- Returns:
- the sorted map. After decoding the flattened data map
- Throws:
IOException- Signals that an I/O exception has occurred.
-
encodeColumnFamily
Encode row/cf. Take a stream of keys and values and output a value that encodes everything but their row and column families keys and values must be paired one for one- Parameters:
keys- the row keys to encode into valuevalues- the value to encode- Returns:
- the value. After encoding keys/values
- Throws:
IOException- Signals that an I/O exception has occurred.
-
filter
- Parameters:
currentRow- All keys and cf have this in their row portion (do not modify!).keys- One key for each key and cf group in the row, ordered as they are given by the source iterator (do not modify!).values- One value for each key in keys, ordered to correspond to the ordering in keys (do not modify!).- Returns:
- true if we want to keep the row, false if we want to skip it
-
deepCopy
Description copied from interface:SortedKeyValueIteratorCreates a deep copy of this iterator as though seek had not yet been called. init should be called on an iterator before deepCopy is called. init should not need to be called on the copy that is returned by deepCopy; that is, when necessary init should be called in the deepCopy method on the iterator it returns. The behavior is unspecified if init is called after deepCopy either on the original or the copy. A proper implementation would call deepCopy on the source.- Specified by:
deepCopyin interfaceSortedKeyValueIterator<Key,Value> - Parameters:
env-IteratorEnvironmentenvironment in which iterator is being run, provided by Accumulo itself and is expected to be non-null.- Returns:
SortedKeyValueIteratora copy of this iterator (with the same source and settings).
-
getTopKey
Description copied from interface:SortedKeyValueIteratorReturns top key. Can be called 0 or more times without affecting behavior of next() or hasTop(). Note that in minor compaction scope and in non-full major compaction scopes the iterator may see deletion entries. These entries should be preserved by all iterators except ones that are strictly scan-time iterators that will never be configured for the minc or majc scopes. Deletion entries are only removed during full major compactions.For performance reasons, iterators reserve the right to reuse objects returned by
getTopKeywhenSortedKeyValueIterator.next()is called, changing the data that the object references. Iterators that need to save an object returned bygetTopKeyought to copy the object's data into a new object in order to avoid aliasing bugs.- Specified by:
getTopKeyin interfaceSortedKeyValueIterator<Key,Value> - Returns:
K
-
getTopValue
Description copied from interface:SortedKeyValueIteratorReturns top value. Can be called 0 or more times without affecting behavior of next() or hasTop().For performance reasons, iterators reserve the right to reuse objects returned by
getTopValuewhenSortedKeyValueIterator.next()is called, changing the underlying data that the object references. Iterators that need to save an object returned bygetTopValueought to copy the object's data into a new object in order to avoid aliasing bugs.- Specified by:
getTopValuein interfaceSortedKeyValueIterator<Key,Value> - Returns:
V
-
hasTop
public boolean hasTop()Description copied from interface:SortedKeyValueIteratorReturns true if the iterator has more elements. Note that if this iterator has yielded (@see YieldingKeyValueIterator.enableYielding(YieldCallback)), this this method must return false.- Specified by:
hasTopin interfaceSortedKeyValueIterator<Key,Value> - Returns:
trueif the iterator has more elements.
-
init
public void init(SortedKeyValueIterator<Key, Value> source, Map<String, throws IOExceptionString> options, IteratorEnvironment env) Description copied from interface:SortedKeyValueIteratorInitializes the iterator. Data should not be read from the source in this method.- Specified by:
initin interfaceSortedKeyValueIterator<Key,Value> - Parameters:
source-SortedKeyValueIteratorsource to read data from.options-Mapmap of string option names to option values.env-IteratorEnvironmentenvironment in which iterator is being run, provided by Accumulo itself and is expected to be non-null.- Throws:
IOException- unused.
-
next
Description copied from interface:SortedKeyValueIteratorAdvances to the next K,V pair. Note that in minor compaction scope and in non-full major compaction scopes the iterator may see deletion entries. These entries should be preserved by all iterators except ones that are strictly scan-time iterators that will never be configured for the minc or majc scopes. Deletion entries are only removed during full major compactions.- Specified by:
nextin interfaceSortedKeyValueIterator<Key,Value> - Throws:
IOException- if an I/O error occurs.
-
seek
public void seek(Range range, Collection<ByteSequence> columnFamilies, boolean inclusive) throws IOException Description copied from interface:SortedKeyValueIteratorSeeks to the first key in the Range, restricting the resulting K,V pairs to those with the specified columns. An iterator does not have to stop at the end of the range. The whole range is provided so that iterators can make optimizations. Seek may be called multiple times with different parameters afterSortedKeyValueIterator.init(org.apache.accumulo.core.iterators.SortedKeyValueIterator<K, V>, java.util.Map<java.lang.String, java.lang.String>, org.apache.accumulo.core.iterators.IteratorEnvironment)is called. Iterators that examine groups of adjacent key/value pairs (e.g. rows) to determine their top key and value should be sure that they properly handle a seek to a key in the middle of such a group (e.g. the middle of a row). Even if the client always seeks to a range containing an entire group (a,c), the tablet server could send back a batch of entries corresponding to (a,b], then reseek the iterator to range (b,c) when the scan is continued.columnFamiliesis used, at the lowest level, to determine which data blocks inside of an RFile need to be opened for this iterator. This set of data blocks is also the set of locality groups defined for the given table. If no columnFamilies are provided, the data blocks for all locality groups inside of the correct RFile will be opened and seeked in an attempt to find the correct start key, regardless of the startKey in therange. In an Accumulo instance in which multiple locality groups exist for a table, it is important to ensure thatcolumnFamiliesis properly set to the minimum required column families to ensure that data from separate locality groups is not inadvertently read.- Specified by:
seekin interfaceSortedKeyValueIterator<Key,Value> - Parameters:
range-Rangeof keys to iterate over.columnFamilies-Collectionof column families to include or exclude.inclusive-booleanthat indicates whether to include (true) or exclude (false) column families.- Throws:
IOException- if an I/O error occurs.
-
describeOptions
Description copied from interface:OptionDescriberGets an iterator options object that contains information needed to configure this iterator. This object will be used by the accumulo shell to prompt the user to input the appropriate information.- Specified by:
describeOptionsin interfaceOptionDescriber- Returns:
- an iterator options object
-
validateOptions
Description copied from interface:OptionDescriberCheck to see if an options map contains all options required by an iterator and that the option values are in the expected formats.- Specified by:
validateOptionsin interfaceOptionDescriber- Parameters:
options- a map of option names to option values- Returns:
- true if options are valid, false otherwise (IllegalArgumentException preferred)
-