Pinot provides metrics out of the box so that you can monitor every aspect of performance and robustness of the Pinot cluster. Most of the metrics are available either at table level or instance level. The metrics can be divided into the following major types when plotting -
Gauge - These represent a single value at any point in time.
Meter - These represent rates of the metric per minute, per 5 minute etc.
Timer - These metrics record durations and can be used to fetch average last 5 minute duration, 75, 99, 999 percentile values, min-max values etc.
This gives a binary value based on whether low-level consumption is healthy (1) or unhealthy (0). It’s important to ensure at least a single replica of each partition is consuming.
The highest offset which has been consumed so far
total number of records in table
total number of segments in table
total unique primary keys in table
time in seconds it took for latest realtime segment to get created
time in seconds it took for segment creation to start (generally due to waiting for a lock to get acquired)