Class RangePartitioner

java.lang.Object
org.apache.hadoop.mapreduce.Partitioner<org.apache.hadoop.io.Text,org.apache.hadoop.io.Writable>
org.apache.accumulo.hadoop.mapreduce.partition.RangePartitioner
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable

public class RangePartitioner extends org.apache.hadoop.mapreduce.Partitioner<org.apache.hadoop.io.Text,org.apache.hadoop.io.Writable> implements org.apache.hadoop.conf.Configurable
Hadoop partitioner that uses ranges, and optionally sub-bins based on hashing.
Since:
2.0.0
  • Constructor Summary

    Constructors
    Constructor
    Description
     
  • Method Summary

    Modifier and Type
    Method
    Description
    org.apache.hadoop.conf.Configuration
     
    int
    getPartition(org.apache.hadoop.io.Text key, org.apache.hadoop.io.Writable value, int numPartitions)
     
    void
    setConf(org.apache.hadoop.conf.Configuration conf)
     
    static void
    setNumSubBins(org.apache.hadoop.mapreduce.Job job, int num)
    Sets the number of random sub-bins per range
    static void
    setSplitFile(org.apache.hadoop.mapreduce.Job job, String file)
    Sets the hdfs file name to use, containing a newline separated list of Base64 encoded split points that represent ranges for partitioning

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Constructor Details

    • RangePartitioner

      public RangePartitioner()
  • Method Details

    • getPartition

      public int getPartition(org.apache.hadoop.io.Text key, org.apache.hadoop.io.Writable value, int numPartitions)
      Specified by:
      getPartition in class org.apache.hadoop.mapreduce.Partitioner<org.apache.hadoop.io.Text,org.apache.hadoop.io.Writable>
    • getConf

      public org.apache.hadoop.conf.Configuration getConf()
      Specified by:
      getConf in interface org.apache.hadoop.conf.Configurable
    • setConf

      public void setConf(org.apache.hadoop.conf.Configuration conf)
      Specified by:
      setConf in interface org.apache.hadoop.conf.Configurable
    • setSplitFile

      public static void setSplitFile(org.apache.hadoop.mapreduce.Job job, String file)
      Sets the hdfs file name to use, containing a newline separated list of Base64 encoded split points that represent ranges for partitioning
    • setNumSubBins

      public static void setNumSubBins(org.apache.hadoop.mapreduce.Job job, int num)
      Sets the number of random sub-bins per range