Package | Description |
---|---|
datafu.org.apache.pig.piggybank.evaluation | |
datafu.pig.bags |
A collection of general purpose UDFs for operating on bags.
|
datafu.pig.geo |
UDFs for geographic computations.
|
datafu.pig.hash |
UDFs for computing hashes from data.
|
datafu.pig.hash.lsh |
UDFs for Locality Sensitive Hashing.
|
datafu.pig.hash.lsh.cosine |
Implementation of Locality Sensitive Hashing
for Cosine Similarity.
|
datafu.pig.hash.lsh.interfaces |
Interfaces used in the implementation of Locality Sensitive Hashing.
|
datafu.pig.hash.lsh.metric |
UDFs for different distance functions (and some similarity functions)
used with Locality Sensitive Hashing.
|
datafu.pig.hash.lsh.p_stable |
Implementation of Locality Sensitive Hashing for
L1 and L2 metrics.
|
datafu.pig.hash.lsh.util |
Utility functions for locality sensitive hashes
|
datafu.pig.linkanalysis |
UDFs for performing link analysis, such as PageRank.
|
datafu.pig.random |
UDFs dealing with randomness.
|
datafu.pig.sampling |
Sampling UDFs, including weighted sample, reservoir sampling, sampling by key, etc.
|
datafu.pig.sessions |
UDFs for sessionizing data.
|
datafu.pig.sets |
UDFs for set operations such as intersect and union.
|
datafu.pig.stats |
Statistics UDFs for computing median, quantiles, variance, confidence intervals, etc.
|
datafu.pig.stats.entropy | |
datafu.pig.text.opennlp | |
datafu.pig.urls |
UDFs for processing URLs.
|
datafu.pig.util |
Other useful utilities.
|