|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.hadoop.conf.Configured datafu.hourglass.jobs.AbstractJob datafu.hourglass.jobs.TimeBasedJob datafu.hourglass.jobs.IncrementalJob datafu.hourglass.jobs.AbstractPartitionPreservingIncrementalJob datafu.hourglass.jobs.PartitionPreservingIncrementalJob
public class PartitionPreservingIncrementalJob
A concrete version of AbstractPartitionPreservingIncrementalJob
.
This provides an alternative to extending AbstractPartitionPreservingIncrementalJob
.
Instead of extending this class and implementing the abstract methods, this concrete version
can be used instead. Getters and setters have been provided for the abstract methods.
Nested Class Summary |
---|
Nested classes/interfaces inherited from class datafu.hourglass.jobs.AbstractPartitionPreservingIncrementalJob |
---|
AbstractPartitionPreservingIncrementalJob.Report |
Constructor Summary | |
---|---|
PartitionPreservingIncrementalJob(java.lang.Class cls)
Initializes the job. |
Method Summary | |
---|---|
void |
config(org.apache.hadoop.conf.Configuration conf)
Overridden to provide custom configuration before the job starts. |
Accumulator<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> |
getCombinerAccumulator()
Gets the accumulator used for the combiner. |
protected org.apache.avro.Schema |
getIntermediateValueSchema()
Gets the Avro schema for the intermediate value. |
protected org.apache.avro.Schema |
getKeySchema()
Gets the Avro schema for the key. |
Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> |
getMapper()
Gets the mapper. |
protected org.apache.avro.Schema |
getOutputValueSchema()
Gets the Avro schema for the output data. |
Accumulator<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> |
getReducerAccumulator()
Gets the accumulator used for the reducer. |
void |
setCombinerAccumulator(Accumulator<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> combiner)
Set the accumulator for the combiner |
void |
setIntermediateValueSchema(org.apache.avro.Schema intermediateValueSchema)
Sets the Avro schema for the intermediate value. |
void |
setKeySchema(org.apache.avro.Schema keySchema)
Sets the Avro schema for the key. |
void |
setMapper(Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> mapper)
Set the mapper. |
void |
setOnSetup(Setup setup)
Set callback to provide custom configuration before job begins execution. |
void |
setOutputValueSchema(org.apache.avro.Schema outputValueSchema)
Sets the Avro schema for the output data. |
void |
setReducerAccumulator(Accumulator<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> reducer)
Set the accumulator for the reducer. |
Methods inherited from class datafu.hourglass.jobs.AbstractPartitionPreservingIncrementalJob |
---|
getCombineProcessor, getMapProcessor, getOutputSchemaName, getOutputSchemaNamespace, getReduceProcessor, getReports, initialize, run |
Methods inherited from class datafu.hourglass.jobs.IncrementalJob |
---|
getMaxIterations, getMaxToProcess, getSchemas, isFailOnMissing, setFailOnMissing, setMaxIterations, setMaxToProcess, setProperties |
Methods inherited from class datafu.hourglass.jobs.TimeBasedJob |
---|
getDaysAgo, getEndDate, getNumDays, getStartDate, setDaysAgo, setEndDate, setNumDays, setStartDate, validate |
Methods inherited from class datafu.hourglass.jobs.AbstractJob |
---|
createRandomTempPath, ensurePath, getCountersParentPath, getFileSystem, getInputPaths, getName, getNumReducers, getOutputPath, getProperties, getRetentionCount, getTempPath, isUseCombiner, randomTempPath, setCountersParentPath, setInputPaths, setName, setNumReducers, setOutputPath, setRetentionCount, setTempPath, setUseCombiner |
Methods inherited from class org.apache.hadoop.conf.Configured |
---|
getConf, setConf |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public PartitionPreservingIncrementalJob(java.lang.Class cls) throws java.io.IOException
cls
- class to base job name on
java.io.IOException
Method Detail |
---|
public Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> getMapper()
AbstractPartitionPreservingIncrementalJob
getMapper
in class AbstractPartitionPreservingIncrementalJob
public Accumulator<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> getCombinerAccumulator()
AbstractPartitionPreservingIncrementalJob
getCombinerAccumulator
in class AbstractPartitionPreservingIncrementalJob
public Accumulator<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> getReducerAccumulator()
AbstractPartitionPreservingIncrementalJob
getReducerAccumulator
in class AbstractPartitionPreservingIncrementalJob
protected org.apache.avro.Schema getKeySchema()
IncrementalJob
This is also used as the key for the map output.
getKeySchema
in class IncrementalJob
protected org.apache.avro.Schema getIntermediateValueSchema()
IncrementalJob
This is also used for the value for the map output.
getIntermediateValueSchema
in class IncrementalJob
protected org.apache.avro.Schema getOutputValueSchema()
IncrementalJob
getOutputValueSchema
in class IncrementalJob
public void setMapper(Mapper<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> mapper)
mapper
- public void setCombinerAccumulator(Accumulator<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> combiner)
combiner
- accumulator for the combinerpublic void setReducerAccumulator(Accumulator<org.apache.avro.generic.GenericRecord,org.apache.avro.generic.GenericRecord> reducer)
reducer
- accumulator for the reducerpublic void setKeySchema(org.apache.avro.Schema keySchema)
This is also used as the key for the map output.
keySchema
- key schemapublic void setIntermediateValueSchema(org.apache.avro.Schema intermediateValueSchema)
This is also used for the value for the map output.
intermediateValueSchema
- intermediate value schemapublic void setOutputValueSchema(org.apache.avro.Schema outputValueSchema)
outputValueSchema
- output value schemapublic void setOnSetup(Setup setup)
setup
- object with callback methodpublic void config(org.apache.hadoop.conf.Configuration conf)
AbstractJob
config
in class AbstractJob
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |