| Package | Description | 
|---|---|
| datafu.org.apache.pig.piggybank.evaluation | |
| datafu.pig.bags | A collection of general purpose UDFs for operating on bags. | 
| datafu.pig.geo | UDFs for geographic computations. | 
| datafu.pig.hash | UDFs for computing hashes from data. | 
| datafu.pig.hash.lsh | UDFs for Locality Sensitive Hashing. | 
| datafu.pig.hash.lsh.cosine | Implementation of Locality Sensitive Hashing
 for Cosine Similarity. | 
| datafu.pig.hash.lsh.interfaces | Interfaces used in the implementation of Locality Sensitive Hashing. | 
| datafu.pig.hash.lsh.metric | UDFs for different distance functions (and some similarity functions)
 used with Locality Sensitive Hashing. | 
| datafu.pig.hash.lsh.p_stable | Implementation of Locality Sensitive Hashing for 
 L1 and L2 metrics. | 
| datafu.pig.hash.lsh.util | Utility functions for locality sensitive hashes | 
| datafu.pig.linkanalysis | UDFs for performing link analysis, such as PageRank. | 
| datafu.pig.random | UDFs dealing with randomness. | 
| datafu.pig.sampling | Sampling UDFs, including weighted sample, reservoir sampling, sampling by key, etc. | 
| datafu.pig.sessions | UDFs for sessionizing data. | 
| datafu.pig.sets | UDFs for set operations such as intersect and union. | 
| datafu.pig.stats | Statistics UDFs for computing median, quantiles, variance, confidence intervals, etc. | 
| datafu.pig.stats.entropy | |
| datafu.pig.text.opennlp | |
| datafu.pig.urls | UDFs for processing URLs. | 
| datafu.pig.util | Other useful utilities. |