public class PartitioningMapper extends ObjectMapper implements java.io.Serializable
AbstractPartitionPreservingIncrementalJob
and its derived classes.
An implementation of Mapper
is used for the
map operation, which produces key and intermediate value pairs from the input.
The input to the mapper is assumed to be partitioned by day.
Each key produced by Mapper
is tagged with the time for the partition
that the input came from. This enables the combiner and reducer to preserve the partitions.
Constructor and Description |
---|
PartitioningMapper() |
Modifier and Type | Method and Description |
---|---|
Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> |
getMapper()
Gets the mapper.
|
PartitionPreservingSchemas |
getSchemas()
Gets the Avro schemas.
|
void |
map(java.lang.Object inputObj,
org.apache.hadoop.mapreduce.MapContext<java.lang.Object,java.lang.Object,java.lang.Object,java.lang.Object> context) |
void |
setContext(org.apache.hadoop.mapreduce.TaskInputOutputContext<java.lang.Object,java.lang.Object,java.lang.Object,java.lang.Object> context) |
void |
setMapper(Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> mapper)
Sets the mapper.
|
void |
setSchemas(PartitionPreservingSchemas schemas)
Sets the Avro schemas.
|
close, getContext
public void map(java.lang.Object inputObj, org.apache.hadoop.mapreduce.MapContext<java.lang.Object,java.lang.Object,java.lang.Object,java.lang.Object> context) throws java.io.IOException, java.lang.InterruptedException
map
in class ObjectMapper
java.io.IOException
java.lang.InterruptedException
public Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> getMapper()
public void setMapper(Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> mapper)
mapper
- the mapperpublic void setSchemas(PartitionPreservingSchemas schemas)
schemas
- the schemaspublic PartitionPreservingSchemas getSchemas()
public void setContext(org.apache.hadoop.mapreduce.TaskInputOutputContext<java.lang.Object,java.lang.Object,java.lang.Object,java.lang.Object> context)
setContext
in class ObjectProcessor