Package datafu.pig.stats

Statistics UDFs for computing median, quantiles, variance, confidence intervals, etc.

See:
          Description

Class Summary
HyperLogLogPlusPlus A UDF that applies the HyperLogLog++ cardinality estimation algorithm.
MarkovPairs Accepts a bag of tuples, with user supplied ordering, and generates pairs that can be used for a Markov chain analysis.
Median Computes the median for a sorted input bag, using type R-2 estimation.
Quantile Computes quantiles for a sorted input bag, using type R-2 estimation.
QuantileUtil Methods used by Quantile.
StreamingMedian Computes the approximate median for a (not necessarily sorted) input bag, using the Munro-Paterson algorithm.
StreamingQuantile Computes approximate quantiles for a (not necessarily sorted) input bag, using the Munro-Paterson algorithm.
VAR Generates the Variance of a set of Values.
VAR.Final  
VAR.Initial  
VAR.Intermediate  
WilsonBinConf Computes the Wilsonian binomial proportion confidence interval
 

Package datafu.pig.stats Description

Statistics UDFs for computing median, quantiles, variance, confidence intervals, etc.



Matthew Hayes, Sam Shah