public class PartitioningMapper extends ObjectMapper implements java.io.Serializable
AbstractPartitionPreservingIncrementalJob and its derived classes.
An implementation of Mapper is used for the
map operation, which produces key and intermediate value pairs from the input.
The input to the mapper is assumed to be partitioned by day.
Each key produced by Mapper is tagged with the time for the partition
that the input came from. This enables the combiner and reducer to preserve the partitions.
| Constructor and Description |
|---|
PartitioningMapper() |
| Modifier and Type | Method and Description |
|---|---|
Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> |
getMapper()
Gets the mapper.
|
PartitionPreservingSchemas |
getSchemas()
Gets the Avro schemas.
|
void |
map(java.lang.Object inputObj,
org.apache.hadoop.mapreduce.MapContext<java.lang.Object,java.lang.Object,java.lang.Object,java.lang.Object> context) |
void |
setContext(org.apache.hadoop.mapreduce.TaskInputOutputContext<java.lang.Object,java.lang.Object,java.lang.Object,java.lang.Object> context) |
void |
setMapper(Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> mapper)
Sets the mapper.
|
void |
setSchemas(PartitionPreservingSchemas schemas)
Sets the Avro schemas.
|
close, getContextpublic void map(java.lang.Object inputObj,
org.apache.hadoop.mapreduce.MapContext<java.lang.Object,java.lang.Object,java.lang.Object,java.lang.Object> context)
throws java.io.IOException,
java.lang.InterruptedException
map in class ObjectMapperjava.io.IOExceptionjava.lang.InterruptedExceptionpublic Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> getMapper()
public void setMapper(Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> mapper)
mapper - the mapperpublic void setSchemas(PartitionPreservingSchemas schemas)
schemas - the schemaspublic PartitionPreservingSchemas getSchemas()
public void setContext(org.apache.hadoop.mapreduce.TaskInputOutputContext<java.lang.Object,java.lang.Object,java.lang.Object,java.lang.Object> context)
setContext in class ObjectProcessor