issues
search
huggingface
/
dataset-viewer
Lightweight web API for visualizing and exploring any dataset - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
https://huggingface.co/docs/datasets-server
Apache License 2.0
640
stars
65
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump certifi from 2023.7.22 to 2024.7.4 in /docs in the pip group across 1 directory
#2973
dependabot[bot]
opened
2 hours ago
1
Decrease worker size metrics when dataset is blocked
#2972
AndreaFrancis
opened
10 hours ago
0
Fix dataset name when decreasing metrics
#2971
AndreaFrancis
closed
1 day ago
0
[Modalities] Account for image URLs dataset for Image modality
#2970
lhoestq
opened
1 day ago
1
Add threshold to modalities from filetypes
#2969
lhoestq
closed
1 day ago
0
WIP: Try to get languages from librarian bot PR for FTS
#2968
AndreaFrancis
closed
14 hours ago
5
Add duration to cached steps
#2967
polinaeterna
opened
2 days ago
0
Use placeholder revision in urls in cached responses
#2966
lhoestq
opened
2 days ago
0
Viewer doesn't show images properly after a smart update
#2965
lhoestq
opened
2 days ago
0
Viewer shows outdated cache after renaming a repo and creating a new one with the old name
#2964
albertvillanova
opened
3 days ago
3
Fix ISO 639-1 mapping for stemming
#2963
AndreaFrancis
closed
4 days ago
0
Removing has_fts field from split-duckdb-index
#2962
AndreaFrancis
closed
4 days ago
0
update test_plan_job_creation_and_termination
#2961
lhoestq
closed
1 week ago
1
Detect rename in smart update
#2960
lhoestq
closed
1 week ago
2
add diagram to docs
#2959
severo
closed
1 week ago
0
Remove blocked only job types
#2958
severo
closed
1 week ago
1
Remove logic for `WORKER_JOB_TYPES_BLOCKED` and `WORKER_JOB_TYPES_ONLY`
#2957
severo
closed
1 week ago
0
Elaborate a diagram that describes the queues/prioritization logic
#2956
severo
closed
1 week ago
2
prioritize jobs from trendy/important datasets
#2955
severo
opened
1 week ago
0
Smart update on all datasets
#2954
lhoestq
closed
1 week ago
0
add /admin/blocked-datasets endpoint
#2953
severo
closed
1 week ago
0
No cache in smart update
#2952
lhoestq
closed
1 week ago
0
Add estimated_num_rows in openapi
#2951
lhoestq
opened
1 week ago
1
add missing migration for estimated_num_rows
#2950
lhoestq
closed
1 week ago
0
Ignore blocked datasets in WorkerSize metrics for auto scaling
#2949
AndreaFrancis
closed
1 week ago
1
Exclude blocked datasets from Job metrics
#2948
AndreaFrancis
closed
1 week ago
1
update indexes
#2947
severo
closed
1 week ago
0
add cudf to toctree
#2946
raybellwaves
closed
1 week ago
1
Add "blocked/not blocked" in job count metrics
#2945
severo
opened
1 week ago
4
Remove old get_df code
#2944
AndreaFrancis
closed
1 week ago
0
WIP: Support `Sequence()` features in Croissant crumbs.
#2943
marcenacp
opened
1 week ago
0
Increase blockage duration
#2942
severo
closed
1 week ago
0
add cudf example
#2941
raybellwaves
closed
1 week ago
2
Add num_rows estimate in hub_cache
#2940
lhoestq
closed
1 week ago
0
fix flaky test gen_kwargs
#2939
lhoestq
closed
2 weeks ago
0
Do not keep DataFrames in memory in orchestrator classes
#2938
albertvillanova
opened
2 weeks ago
0
Enable estimate info (size) on all datasets
#2937
lhoestq
closed
2 weeks ago
2
Update urllib3 to 1.26.19 and 2.2.2 to fix vulnerability
#2936
albertvillanova
closed
1 week ago
2
divide the rate-limit budget by 5
#2935
severo
closed
2 weeks ago
0
Update scikit-learn to 1.5.0 to fix vulnerability
#2934
albertvillanova
closed
2 weeks ago
0
create datasetBlockages collection + block datasets
#2933
severo
closed
2 weeks ago
3
Fix estimate info for zip datasets
#2932
lhoestq
closed
2 weeks ago
0
Create pastJobs collection
#2931
severo
closed
2 weeks ago
1
[refactoring] split queue.py in 3 modules
#2930
severo
closed
2 weeks ago
0
Use current priority for children jobs
#2929
severo
closed
2 weeks ago
1
FTS: Add specific stemmer for monolingual datasets
#2928
AndreaFrancis
closed
1 week ago
1
Separate expected errors from unexpected ones in Grafana
#2927
severo
opened
2 weeks ago
0
only raise error in config-is-valid if format is bad
#2926
severo
closed
2 weeks ago
1
Reorder and hide columns within dataset viewer
#2925
davidberenstein1957
opened
2 weeks ago
1
Delete canonical datasets
#2924
severo
opened
2 weeks ago
0
Next