huggingface / dataset-viewer

Lightweight web API for visualizing and exploring any dataset - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
https://huggingface.co/docs/datasets-server
Apache License 2.0
640 stars 65 forks source link

Apply recommendations from duckdb to improve speed #2882

Open severo opened 1 month ago

severo commented 1 month ago

DuckDB has a dedicated page called "My Workload Is Slow"

https://duckdb.org/docs/guides/performance/my_workload_is_slow

and more generally all the https://duckdb.org/docs/guides/performance/overview section.

It could be good to review if some recommendations apply to our usage of duckdb.

AndreaFrancis commented 1 month ago

I would like to work on this since I analyzed most of the document's points when trying to optimize /search but did not document my findings.