V2 Multi-Stage Query Engine

Overview

The new multi-stage query engine (a.k.a V2 query engine) is designed to support more complex SQL semantics such as JOIN, OVER window, MATCH_RECOGNIZE and eventually, make Pinot support closer to full ANSI SQL semantics.

It also resolves the bottleneck effect for the broker reduce stage where only a single machine is dedicated to perform heavy lifting such as high cardinality GROUP BY result merging; ORDER BY sorting, etc.

How to use the V2 query engine

To enable the V2 engine,

please make sure to either
- Building Apache Pinot using the latest master commit.
- Download the latest Apache Pinot docker image using the official guide.

Please add the following configurations to your cluster config:

"pinot.multistage.engine.enabled": "true",
"pinot.server.instance.currentDataTableVersion": "4",
"pinot.query.server.port": "8421",
"pinot.query.runner.port": "8442"

Start the cluster normally, you should see the following window in the controller query page:
Sample Query Screenshot

Design Details

The overall PEP design doc and discussion can be found in the following links

PreviousNull Value Support NextAdvanced Pinot Setup

Last updated 2 years ago

Was this helpful?