Comment on page
This section contains reference documentation for the PERCENTILEKLL function.
KLL Sketchis an approxiamate quantiles algorithm which targets optimal space for a given accuracy.
PERCENTILEKLLis a percentile calculation aggregation function based on Apache Datasketches KLL Doubles Sketch implementation.
Pinot also offers a 'raw' variant,
PERCENTILEKLLRAW, which returns the serialized sketch that can be used for calculating 'rank' or 'histogram'.
All of the variants of
PercentileKLLalso support raw sketches in Pinot columns. This means you can create KLL Doubles sketches outside of Pinot and ingest them into columns as binary strings.
PercentileKLLwill identify these columns merge them to produce aggregate results.
PercentileKLL(column, percentile, kValue) -> Double
column(required): Name of the column to aggregate on. If the column is a multi value column, use
percentile(required): Percentile value to be calculated [0..100]
select percentileKLL(ArrDelayMinutes, 90) as DelayP90
select Carrier, percentileKll(ArrDelay, 50, 600) as MedianDelay
where ArrDelay > 0
group by Carrier
order by 2 desc