issues
search
huggingface
/
dataset-viewer
Lightweight web API for visualizing and exploring any dataset - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
https://huggingface.co/docs/datasets-server
Apache License 2.0
639
stars
65
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix ISO 639-1 mapping for stemming
#2963
AndreaFrancis
opened
9 minutes ago
0
Removing has_fts field from split-duckdb-index
#2962
AndreaFrancis
opened
3 days ago
0
update test_plan_job_creation_and_termination
#2961
lhoestq
closed
3 days ago
1
Detect rename in smart update
#2960
lhoestq
closed
3 days ago
2
add diagram to docs
#2959
severo
closed
3 days ago
0
Remove blocked only job types
#2958
severo
closed
4 days ago
1
Remove logic for `WORKER_JOB_TYPES_BLOCKED` and `WORKER_JOB_TYPES_ONLY`
#2957
severo
closed
4 days ago
0
Elaborate a diagram that describes the queues/prioritization logic
#2956
severo
closed
3 days ago
2
prioritize jobs from trendy/important datasets
#2955
severo
opened
4 days ago
0
Smart update on all datasets
#2954
lhoestq
closed
4 days ago
0
add /admin/blocked-datasets endpoint
#2953
severo
closed
4 days ago
0
No cache in smart update
#2952
lhoestq
closed
4 days ago
0
Add estimated_num_rows in openapi
#2951
lhoestq
opened
5 days ago
1
add missing migration for estimated_num_rows
#2950
lhoestq
closed
5 days ago
0
Ignore blocked datasets in WorkerSize metrics for auto scaling
#2949
AndreaFrancis
closed
5 days ago
1
Exclude blocked datasets from Job metrics
#2948
AndreaFrancis
closed
6 days ago
1
update indexes
#2947
severo
closed
6 days ago
0
add cudf to toctree
#2946
raybellwaves
closed
6 days ago
1
Add "blocked/not blocked" in job count metrics
#2945
severo
opened
6 days ago
4
Remove old get_df code
#2944
AndreaFrancis
closed
1 week ago
0
WIP: Support `Sequence()` features in Croissant crumbs.
#2943
marcenacp
opened
1 week ago
0
Increase blockage duration
#2942
severo
closed
1 week ago
0
add cudf example
#2941
raybellwaves
closed
6 days ago
2
Add num_rows estimate in hub_cache
#2940
lhoestq
closed
5 days ago
0
fix flaky test gen_kwargs
#2939
lhoestq
closed
1 week ago
0
Do not keep DataFrames in memory in orchestrator classes
#2938
albertvillanova
opened
1 week ago
0
Enable estimate info (size) on all datasets
#2937
lhoestq
closed
1 week ago
2
Update urllib3 to 1.26.19 and 2.2.2 to fix vulnerability
#2936
albertvillanova
closed
6 days ago
2
divide the rate-limit budget by 5
#2935
severo
closed
1 week ago
0
Update scikit-learn to 1.5.0 to fix vulnerability
#2934
albertvillanova
closed
1 week ago
0
create datasetBlockages collection + block datasets
#2933
severo
closed
1 week ago
3
Fix estimate info for zip datasets
#2932
lhoestq
closed
1 week ago
0
Create pastJobs collection
#2931
severo
closed
1 week ago
1
[refactoring] split queue.py in 3 modules
#2930
severo
closed
1 week ago
0
Use current priority for children jobs
#2929
severo
closed
1 week ago
1
FTS: Add specific stemmer for monolingual datasets
#2928
AndreaFrancis
closed
5 days ago
1
Separate expected errors from unexpected ones in Grafana
#2927
severo
opened
1 week ago
0
only raise error in config-is-valid if format is bad
#2926
severo
closed
1 week ago
1
Reorder and hide columns within dataset viewer
#2925
davidberenstein1957
opened
1 week ago
1
Delete canonical datasets
#2924
severo
opened
1 week ago
0
Fix estimate info allow_list
#2923
lhoestq
closed
1 week ago
0
admin-ui: Do not mark gated datasets as error
#2922
AndreaFrancis
closed
1 week ago
0
Do not keep DataFrames in memory in State classes
#2921
albertvillanova
closed
1 week ago
0
order the steps alphabetically
#2920
severo
closed
1 week ago
0
Do not propagate error for is valid and hub cache
#2919
severo
closed
1 week ago
0
The "dataset-hub-cache" and "dataset-is-valid" steps should always return a value
#2918
severo
closed
1 week ago
1
admin UI: automatically fill the steps list
#2917
severo
closed
1 week ago
1
[modality detection] One image in the repo -> Image modality
#2916
severo
opened
1 week ago
0
Bump urllib3 from 2.0.7 to 2.2.2 in /docs in the pip group across 1 directory
#2915
dependabot[bot]
opened
1 week ago
1
Prevents viewer from being pinged for the datasets on both leaderboard orgs
#2914
clefourrier
closed
2 weeks ago
1
Next