Open snleee opened 3 years ago
where can we download those data from? Maybe move them to another directory and download them from the github URL?
@xiangfu0
I was thinking of the similar approach:
curl
curl https://raw.githubusercontent.com/apache/pinot/master/pinot-tools/src/main/resources/examples/batch/airlineStats/rawdata/2014/01/01/airlineStats_data_2014-01-01.avro
git clone
and then copy the example files, or git probably supports to checkout the specific directory from the repository.
Currently, our binary distribution packs ~100MB raw data for pinot quick starter scripts. Removing this can greatly reduce the size of our official binary distribution, which is currently over 500MB.
For the above files, we should change the script to download instead of packing along with the release.