Running Pinot in Docker
This quick start guide will show you how to run a Pinot cluster using Docker.
Create an isolated bridge network in docker
docker network create -d bridge pinot-demo
We'll be using our docker image apachepinot/pinot:latest
to run this quick start, which does the following:
Sets up the Pinot cluster
Creates a sample table and loads sample data
There are 3 types of quick start examples.
Batch example
Streaming example
Hybrid example
Batch example
In this example we demonstrate how to do batch processing with Pinot.
Starts Pinot deployment by starting
Apache Zookeeper
Pinot Controller
Pinot Broker
Pinot Server
Creates a demo table
baseballStats
Launches a standalone data ingestion job
Builds one Pinot segment for a given CSV data file for table
baseballStats
Pushes the built segment to the Pinot controller
Issues sample queries to Pinot
docker run \
--network=pinot-demo \
--name pinot-quickstart \
-p 9000:9000 \
-d apachepinot/pinot:latest QuickStart \
-type batch
Once the Docker container is running, you can view the logs by running the following command.
docker logs pinot-quickstart -f
That's it! We've spun up a Pinot cluster.
docker logs pinot-quickstart -f
Your cluster is ready once you see the cluster setup completion messages and sample queries, as demonstrated below.
You can head over to Exploring Pinot to check out the data in the baseballStats
table.
Streaming example
In this example we demonstrate how to do stream processing with Pinot.
Starts Pinot deployment by starting
Apache Kafka
Apache Zookeeper
Pinot Controller
Pinot Broker
Pinot Server
Creates a demo table
meetupRsvp
Launches a
meetup
**streamPublishes data to a Kafka topic
meetupRSVPEvents
to be subscribed to by PinotIssues sample queries to Pinot
# stop previous container, if any, or use different network
docker run \
--network=pinot-demo \
--name pinot-quickstart \
-p 9000:9000 \
-d apachepinot/pinot:latest QuickStart \
-type stream
Once the cluster is up, you can head over to Exploring Pinot to check out the data in the meetupRSVPEvents
table.
Hybrid example
In this example we demonstrate how to do hybrid stream and batch processing with Pinot.
Starts Pinot deployment by starting
Apache Kafka
Apache Zookeeper
Pinot Controller
Pinot Broker
Pinot Server
Creates a demo table
airlineStats
Launches a standalone data ingestion job
Builds Pinot segments under a given directory of Avro files for table
airlineStats
Pushes built segments to Pinot controller
Launches a **stream of flights stats
Publishes data to a Kafka topic
airlineStatsEvents
to be subscribed to by PinotIssues sample queries to Pinot
# stop previous container, if any, or use different network
docker run \
--network=pinot-demo \
--name pinot-quickstart \
-p 9000:9000 \
-d apachepinot/pinot:latest QuickStart \
-type hybrid
Once the cluster is up, you can head over to Exploring Pinot to check out the data in the airlineStats
table.
Last updated
Was this helpful?