Closed domini8888 closed 4 months ago
Hi @domini8888 the galleries are created as follows:
fd.vis.duplicates_gallery() # create a visual gallery of duplicates
fd.vis.outliers_gallery() # create a visual gallery of anomalies
fd.vis.component_gallery() # create a visualization of connected components
fd.vis.stats_gallery() # create a visualization of images statistics (e.g. blur)
fd.vis.similarity_gallery() # create a gallery of similar images
In case you are working inside a jupyter notebook you will see a gallery view, otherwise if you work in a python terminal an html file will be created you can view it using any browser. Let us know if this works.
Fixed the printout, it is edges not nodes
What happened?
Hi! So I ran the guide to extract dataset feature vectors with DINOv2 locally on my computer, and it ran succesfully, but its output is strange saying that the largest cluster has 20,318 images, when I only have 11,000ish images. Why so?
What did you expect to see?
Largest cluster having less than total number of images
What version of fastdup were you runnning on?
1.73
What version of Python were you running on?
Python 3.9
Operating System
Ubuntu 20.04
Reproduction steps
No response
Relevant log output
Attach a screenshot [Optional]
No response
Contact Details [Optional]
dominicc@mit.edu