Most csv files are simply manipulated in __init__. Changing to parquet might increase speed and reduce the size of the repository but will decrease readability (the datasets are examined in the README though). Changing to tsv will enhance human readability.
Most
csv
files are simply manipulated in__init__
. Changing toparquet
might increase speed and reduce the size of the repository but will decrease readability (the datasets are examined in the README though). Changing totsv
will enhance human readability.