Batch Ingestion (Docker)
Step-by-step guide for batch importing data into Pinot running in Docker
Prepare your data
mkdir -p /tmp/pinot-quick-start/rawdatastudentID,firstName,lastName,gender,subject,score,timestampInEpoch
200,Lucy,Smith,Female,Maths,3.8,1570863600000
200,Lucy,Smith,Female,English,3.5,1571036400000
201,Bob,King,Male,Maths,3.2,1571900400000
202,Nick,Young,Male,Physics,3.6,1572418800000Create a schema
Column Type
Description
{
"schemaName": "transcript",
"dimensionFieldSpecs": [
{ "name": "studentID", "dataType": "INT" },
{ "name": "firstName", "dataType": "STRING" },
{ "name": "lastName", "dataType": "STRING" },
{ "name": "gender", "dataType": "STRING" },
{ "name": "subject", "dataType": "STRING" }
],
"metricFieldSpecs": [
{ "name": "score", "dataType": "FLOAT" }
],
"dateTimeFieldSpecs": [{
"name": "timestampInEpoch",
"dataType": "LONG",
"format": "1:MILLISECONDS:EPOCH",
"granularity": "1:MILLISECONDS"
}]
}Create a table configuration
Upload the schema and table configuration
Create and push a segment
Query your data
Last updated
Was this helpful?

