This section contains reference documentation for the HISTOGRAM function.
Returns the count of data points that fall within each bin as a vector. The bins are left-inclusive and right-exclusive, i.e. [a, b)
, except for the last one which is inclusive on both sides [a, b]
.
Equal length bins (better performance):
HISTOGRAM(colName, lower, upper, numBins)
Arbitrary increasing bin edges:
HISTOGRAM(colName, ARRAY[binEdge1, binEdge2, binEdge3, ...])
These examples are based on the Batch Quick Start.
10 equal-length bins [0, 20), [20, 30) ... [180, 200]
histogram |
---|
6 bins (- ∞, 1), [1, 10), [10, 50), [50,100), [100,500), [500, 1000]
histogram |
---|
32348,21519,11359,7587,5488,5360,6282,7361,585,0
13520,16506,18375,12403,28591,8494