Open severo opened 4 months ago
Thanks for the support @severo.
So, my suggestion is as follows (images or audios or any thing with extensions):
So, I can see the total number of images and the number of each extension for the datasets.
We now have the count for every extension in dataset-filetypes
. It's not published in the API though.
Great @severo
Sorry, I am not familiar with this term dataset-filetypes
. What is dataset-filetypes
? Where can I see the feature now, please ?
So, when it published on the API, it will be shown on the Huggingface Datasets ?
dataset-filetypes
is a new "step," i.e., a pre-processed computation. It computes the number of files for each file extension in the main branch.
However, not all "steps" are published in the HTTP API. I haven't created an API endpoint to consume the result yet.
Proposal here