public class CountEach
extends org.apache.pig.AccumulatorEvalFunc<org.apache.pig.data.DataBag>
DEFINE CountEach datafu.pig.bags.CountEach();
DEFINE CountEachFlatten datafu.pig.bags.CountEach('flatten');
-- input:
-- ({(A),(A),(C),(B)})
input = LOAD 'input' AS (B: bag {T: tuple(alpha:CHARARRAY, numeric:INT)});
-- output:
-- {((A),2),((C),1),((B),1)}
output = FOREACH input GENERATE CountEach(B);
-- output_flatten:
-- ({(A,2),(C,1),(B,1)})
output_flatten = FOREACH input GENERATE CountEachFlatten(B);
| Constructor and Description |
|---|
CountEach() |
CountEach(java.lang.String arg) |
| Modifier and Type | Method and Description |
|---|---|
void |
accumulate(org.apache.pig.data.Tuple input) |
void |
cleanup() |
org.apache.pig.data.DataBag |
getValue() |
org.apache.pig.impl.logicalLayer.schema.Schema |
outputSchema(org.apache.pig.impl.logicalLayer.schema.Schema input) |
allowCompileTimeCalculation, finish, getArgToFuncMapping, getCacheFiles, getInputSchema, getLogger, getPigLogger, getReporter, getReturnType, getSchemaName, getSchemaType, getShipFiles, isAsynchronous, progress, setInputSchema, setPigLogger, setReporter, setUDFContextSignature, warnpublic void accumulate(org.apache.pig.data.Tuple input)
throws java.io.IOException
accumulate in interface org.apache.pig.Accumulator<org.apache.pig.data.DataBag>accumulate in class org.apache.pig.AccumulatorEvalFunc<org.apache.pig.data.DataBag>java.io.IOExceptionpublic org.apache.pig.data.DataBag getValue()
getValue in interface org.apache.pig.Accumulator<org.apache.pig.data.DataBag>getValue in class org.apache.pig.AccumulatorEvalFunc<org.apache.pig.data.DataBag>public void cleanup()
cleanup in interface org.apache.pig.Accumulator<org.apache.pig.data.DataBag>cleanup in class org.apache.pig.AccumulatorEvalFunc<org.apache.pig.data.DataBag>public org.apache.pig.impl.logicalLayer.schema.Schema outputSchema(org.apache.pig.impl.logicalLayer.schema.Schema input)
outputSchema in class org.apache.pig.EvalFunc<org.apache.pig.data.DataBag>