> For the complete documentation index, see [llms.txt](https://docs.pinot.apache.org/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.pinot.apache.org/release-1.0.0/basics/components/cluster/broker.md). # Broker Brokers handle Pinot queries. They accept queries from clients and forward them to the right servers. They collect results back from the servers and consolidate them into a single response, to send back to the client. ![Broker interaction with other components](/files/-M1c97qmI9TI8SSD0-5a) Pinot brokers are modeled as Helix **spectators**. They need to know the location of each segment of a table (and each replica of the segments) and route requests to the appropriate server that hosts the segments of the table being queried. The broker ensures that all the rows of the table are queried exactly once so as to return correct, consistent results for a query. The brokers may optimize to **prune some of the segments** as long as accuracy is not sacrificed. Helix provides the framework by which spectators can learn the location in which each partition of a resource (*i.e.* participant) resides. The brokers use this mechanism to learn the servers that host specific segments of a table. In the case of hybrid tables, the brokers ensure that the overlap between real-time and offline segment data is queried exactly once, by performing **offline and real-time federation**. Let's take this example, we have real-time data for 5 days - March 23 to March 27, and offline data has been pushed until Mar 25, which is 2 days behind real-time. The brokers maintain this time boundary. ![](/files/-M1Y6WPgBfIM-iC7cHq3) Suppose, we get a query to this table : `select sum(metric) from table`. The broker will split the query into 2 queries based on this time boundary – one for offline and one for real-time. This query becomes `select sum(metric) from table_REALTIME where date >= Mar 25`\ and `select sum(metric) from table_OFFLINE where date < Mar 25` \ The broker merges results from both these queries before returning the result to the client. ## Starting a broker Make sure you've [set up Zookeeper](/release-1.0.0/basics/components/cluster.md#setup-a-pinot-cluster). If you're using Docker, make sure to [pull the ](/release-1.0.0/basics/components/cluster.md#setup-a-pinot-cluster)[Pinot Docker image](/release-1.0.0/basics/components/cluster.md#setup-a-pinot-cluster). To start a broker: {% tabs %} {% tab title="Docker Image" %} ``` docker run \ --network=pinot-demo \ --name pinot-broker \ -d ${PINOT_IMAGE} StartBroker \ -zkAddress pinot-zookeeper:2181 ``` {% endtab %} {% tab title="Launcher Script" %} ``` bin/pinot-admin.sh StartBroker \ -zkAddress localhost:2181 \ -clusterName PinotCluster \ -brokerPort 7000 ``` {% endtab %} {% endtabs %} --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://docs.pinot.apache.org/release-1.0.0/basics/components/cluster/broker.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.