In this guide you'll learn how to download and install Apache Pinot as a standalone instance.
First, let's download the Pinot distribution for this tutorial. You can either build the distribution from source or download a packaged release.
Follow these steps to checkout code from Github and build Pinot locally
# checkout pinotgit clone https://github.com/apache/pinot.gitcd incubator-pinot# build pinotmvn install package -DskipTests -Pbin-dist# navigate to directory containing the setup scriptscd pinot-distribution/target/apache-pinot-incubating-$PINOT_VERSION-bin/apache-pinot-incubating-$PINOT_VERSION-bin
Download the latest binary release from Apache Pinot, or use this command
PINOT_VERSION=0.7.1 #set to the Pinot version you decide to usewget https://downloads.apache.org/pinot/apache-pinot-incubating-$PINOT_VERSION/apache-pinot-incubating-$PINOT_VERSION-bin.tar.gz
Once you have the tar file,
# untar ittar -zxvf apache-pinot-incubating-$PINOT_VERSION-bin.tar.gz# navigate to directory containing the launcher scriptscd apache-pinot-incubating-$PINOT_VERSION-bin
We'll be using the quick-start scripts provided along with pinot distribution, which do the following:
Set up the Pinot cluster
Create a sample table and load sample data
The following quick start scripts are available. Please note though, these scripts launch the Pinot cluster with minimal resources. If you intend to play with sizable data (more than few MB), you may want to follow the Manual cluster setup and provide required resources.
Batch quick start creates the pinot cluster, creates an offline table
baseballStats and pushes sample offline data to the table.
That's it! We've spun up a Pinot cluster. You can continue playing with other types of quick start, or simply head on to Pinot Data Explorer to check out the data in the
Streaming quick start sets up a Kafka cluster and pushes sample data to a Kafka topic. Then, it creates the Pinot cluster and creates a realtime table
meetupRSVP which ingests data from the Kafka topic.
# stop previous quick start cluster, if anybin/quick-start-streaming.sh
We now have a Pinot cluster with a realtime table! You can head over to Pinot Data Explorer to check out the data in the
Hybrid quick start sets up a Kafka cluster and pushes sample data to a Kafka topic. Then, it creates the Pinot cluster and creates a hybrid table
airlineStats . The realtime table ingests data from the Kafka topic. Lastly, sample data is pushed into the offline table.
# stop previous quick start cluster, if anybin/quick-start-hybrid.sh
Let's head over to Pinot Data Explorer to check out the data we pushed to the