Follow the instructions in Advanced Pinot Setup to setup the Pinot cluster with the components:
Zookeeper
Controller
Broker
Server
Kafka
Create a Kafka topic
Create a Kafka topic called pullRequestMergedEvents for the demo.
Add Pinot table and schema
The schema is present at examples/stream/githubEvents/pullRequestMergedEvents_schema.json and is also pasted below
The table config is present at examples/stream/githubEvents/docker/pullRequestMergedEvents_realtime_table_config.json and is also pasted below.
Note
If you're setting this up on a pre-configured cluster, set the properties stream.kafka.zk.broker.url and stream.kafka.broker.list correctly, depending on the configuration of your Kafka cluster.
Add the table and schema using the following command
Publish events
Start streaming GitHub events into the Kafka topic
Create a Kafka topic called pullRequestMergedEvents for the demo.
Add Pinot table and schema
Schema can be found at /examples/stream/githubevents/ in the release, and is also pasted below:
Table config can be found at /examples/stream/githubevents/ in the release, and is also pasted below.
Note
If you're setting this up on a pre-configured cluster, set the properties stream.kafka.zk.broker.url and stream.kafka.broker.list correctly, depending on the configuration of your Kafka cluster.
Add the table and schema using the command
Publish events
Start streaming GitHub events into the Kafka topic
If you already have a Kubernetes cluster with Pinot and Kafka (see Running Pinot in Kubernetes), first create the topic and then setup the table and streaming using
Query
Head over to the Query Console to checkout the data!
Visualizing on SuperSet
You can use SuperSet to visualize this data. Some of the interesting insights we captures were
Most Active organizations during the lockdown
Repositories by number of commits in the Apache organization