DataFu Hourglass

A framework for incrementally processing data in Hadoop.

See:
          Description

Packages
datafu.hourglass.avro Input and output formats for using Avro in incremental Hadoop jobs.
datafu.hourglass.fs Classes for working with the file system.
datafu.hourglass.jobs Incremental Hadoop jobs and some supporting classes.
datafu.hourglass.mapreduce Implementations of mappers, combiners, and reducers used by incremental jobs.
datafu.hourglass.model Interfaces which define the incremental processing model.
datafu.hourglass.schemas Classes that help manage the Avro schemas used by the jobs.

 

A framework for incrementally processing data in Hadoop.



Matthew Hayes