Quick Start Examples
This section describes quick start commands that launch all Pinot components in a single process.
Pinot ships with QuickStart commands that launch Pinot components in a single process and import pre-built datasets. These QuickStarts are a good place if you're just getting started with Pinot.
Prerequisites
You will need to have installed Pinot locally or have Docker installed if you want to use the Pinot Docker image.
macOS Monterey Users
By default the Airplay receiver server runs on port 7000, which is also the port used by the Pinot Server in the Quick Start. You may see the following error when running these examples:
If you disable the Airplay receiver server and try again, you shouldn't see this error message anymore.
Batch
This example demonstrates how to do batch processing with Pinot. The command:
Starts Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server.
Creates the
baseballStats
tableLaunches a standalone data ingestion job that builds one segment for a given CSV data file for the
baseballStats
table and pushes the segment to the Pinot Controller.Issues sample queries to Pinot
Batch JSON
This example demonstrates how to import and query JSON documents in Pinot. The command:
Starts Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server.
Creates the
githubEvents
tableLaunches a standalone data ingestion job that builds one segment for a given JSON data file for the
githubEvents
table and pushes the segment to the Pinot Controller.Issues sample queries to Pinot
Batch with complex data types
This example demonstrates how to do batch processing in Pinot where the the data items have complex fields that need to be unnested. The command:
Starts Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server.
Creates the
githubEvents
tableLaunches a standalone data ingestion job that builds one segment for a given JSON data file for the
githubEvents
table and pushes the segment to the Pinot Controller.Issues sample queries to Pinot
Streaming
This example demonstrates how to do stream processing with Pinot. The command:
Starts Apache Kafka, Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server.
Creates
meetupRsvp
tableLaunches a
meetup
streamPublishes data to a Kafka topic
meetupRSVPEvents
that is subscribed to by Pinot.Issues sample queries to Pinot
Streaming JSON
This example demonstrates how to do stream processing with JSON documents in Pinot. The command:
Starts Apache Kafka, Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server.
Creates
meetupRsvp
tableLaunches a
meetup
streamPublishes data to a Kafka topic
meetupRSVPEvents
that is subscribed to by PinotIssues sample queries to Pinot
Streaming with minion cleanup
This example demonstrates how to do stream processing in Pinot with RealtimeToOfflineSegmentsTask and MergeRollupTask minion tasks continuously optimizing segments as data gets ingested. The command:
Starts Apache Kafka, Apache Zookeeper, Pinot Controller, Pinot Broker, Pinot Minion, and Pinot Server.
Creates
githubEvents
tableLaunches a GitHub events stream
Publishes data to a Kafka topic
githubEvents
that is subscribed to by Pinot.Issues sample queries to Pinot
Streaming with complex data types
This example demonstrates how to do stream processing in Pinot where the stream contains items that have complex fields that need to be unnested. The command:
Starts Apache Kafka, Apache Zookeeper, Pinot Controller, Pinot Broker, Pinot Minion, and Pinot Server.
Creates
meetupRsvp
tableLaunches a
meetup
streamPublishes data to a Kafka topic
meetupRSVPEvents
that is subscribed to by Pinot.Issues sample queries to Pinot
Upsert
This example demonstrates how to do stream processing with upsert with Pinot. The command:
Starts Apache Kafka, Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server.
Creates
meetupRsvp
tableLaunches a
meetup
streamPublishes data to a Kafka topic
meetupRSVPEvents
that is subscribed to by PinotIssues sample queries to Pinot
Upsert JSON
This example demonstrates how to do stream processing with upsert with JSON documents in Pinot. The command:
Starts Apache Kafka, Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server.
Creates
meetupRsvp
tableLaunches a
meetup
streamPublishes data to a Kafka topic
meetupRSVPEvents
that is subscribed to by PinotIssues sample queries to Pinot
Hybrid
This example demonstrates how to do hybrid stream and batch processing with Pinot. The command:
Starts Apache Kafka, Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server.
Creates
airlineStats
tableLaunches a standalone data ingestion job that builds segments under a given directory of Avro files for the
airlineStats
table and pushes the segments to the Pinot Controller.Launches a stream of flights stats
Publishes data to a Kafka topic
airlineStatsEvents
that is subscribed to by Pinot.Issues sample queries to Pinot
Join
This example demonstrates how to do joins in Pinot using the Lookup UDF. The command:
Starts Apache Zookeeper, Pinot Controller, Pinot Broker, and Pinot Server in the same container.
Creates the
baseballStats
tableLaunches a data ingestion job that builds one segment for a given CSV data file for the
baseballStats
table and pushes the segment to the Pinot Controller.Creates the
dimBaseballTeams
tableLaunches a data ingestion job that builds one segment for a given CSV data file for the
dimBaseballStats
table and pushes the segment to the Pinot Controller.Issues sample queries to Pinot
Last updated