huggingface / dataset-viewer

Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.
https://huggingface.co/docs/dataset-viewer
Apache License 2.0
698 stars 77 forks source link

Image URL detection #3054

Open hlky opened 2 months ago

hlky commented 2 months ago

is_image_url relies on a filename and extension being present, however, in some cases an image URL does not contain a filename. Example dataset and example URL. This could be improved by checking the content-type header of the response or checking for strings like "image" in the URL.