DISTINCTCOUNTSMARTHLL
This section contains reference documentation for the DISTINCTCOUNTHLL function.
Signature
DISTINCTCOUNTSMARTHLL(colName, log2m)
Usage Examples
These examples are based on the Batch Quick Start.
DISTINCTCOUNTSMARTHLL considerations
DISTINCTCOUNTHLL()
is faster thanDISTINCTCOUNT()
if data is pre-aggregated at ingestion or aggregated at a server with enough records. This performance improvement increases when comparing large datasets.If very few records are pre-aggregated,
DISTINCTCOUNTHLL()
will not be as fast asDISTINCTCOUNT()
because the serialized HLL size is larger than sending individual values.DISTINCTCOUNTHLLPLUS()
provides more precise results thanDISTINCTCOUNTHLL()
with the same performance.DISTINCTCOUNTSMARTHLL()
automatically shifts to HLL when reaching a threshold, and comes with some overhead.
Last updated