Class RowColumnSampler

java.lang.Object
org.apache.accumulo.core.client.sample.AbstractHashSampler
org.apache.accumulo.core.client.sample.RowColumnSampler
All Implemented Interfaces:
Sampler

public class RowColumnSampler extends AbstractHashSampler
This sampler can hash any subset of a Key's fields. The fields that hashed for the sample are determined by the configuration options passed in init(SamplerConfiguration). The following key values are valid options.
  • row=true|false
  • family=true|false
  • qualifier=true|false
  • visibility=true|false

If not specified in the options, fields default to false.

To determine what options are valid for hashing see AbstractHashSampler

To configure Accumulo to generate sample data on one thousandth of the column qualifiers, the following SamplerConfiguration could be created and used to configure a table.

 
 new SamplerConfiguration(RowColumnSampler.class.getName()).setOptions(
   ImmutableMap.of("hasher","murmur3_32","modulus","1009","qualifier","true"));
 
 

With this configuration, if a column qualifier is selected then all key values contains that column qualifier will end up in the sample data.

Since:
1.8.0
  • Constructor Details

    • RowColumnSampler

      public RowColumnSampler()
  • Method Details