fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
Other
1.59k
stars
77
forks
source link
[Bug]:AssertionError: For removing wrong labels created by the create_similarity_gallery() need to run stats_file=df where df is the output of create_similarity_gallery() #319
if isinstance(stats_file, pd.DataFrame):
assert isinstance(work_dir, str) and os.path.exists(work_dir), "When providing pandas dataframe need to set work_dir to point to fastdup work_dir"
df = stats_file
else:
df = load_stats(stats_file, work_dir, {})
if metric == "score" and metric not in df.columns:
assert False, "For removing wrong labels created by the create_similarity_gallery() need to run stats_file=df where df is the output of create_similarity_gallery()"
What did you expect to see?
I expected all the similar images to be removed.
What version of fastdup were you runnning on?
1.111
What version of Python were you running on?
Python 3.8
Operating System
Ubuntu 20.04
Reproduction steps
Create a similarity gallery and store the output in a dataframe. fastdup.create_similarity_gallery()
Use the dataframe for removing all the similar images. fastdup.delete_or_retag_stats_outliers()
Relevant log output
Traceback (most recent call last):
File "/home/$USER/miniconda3/envs/env_4/lib/python3.8/site-packages/fastdup/__init__.py", line 1800, in delete_or_retag_stats_outliers
assert False, "For removing wrong labels created by the create_similarity_gallery() need to run stats_file=df where df is the output of create_similarity_gallery()"
AssertionError: For removing wrong labels created by the create_similarity_gallery() need to run stats_file=df where df is the output of create_similarity_gallery()
What happened?
Creating similarity Gallery:
Removing the similar items:
Origin of Error:
file
fastdup/__init__.py
What did you expect to see?
I expected all the similar images to be removed.
What version of fastdup were you runnning on?
1.111
What version of Python were you running on?
Python 3.8
Operating System
Ubuntu 20.04
Reproduction steps
fastdup.create_similarity_gallery()
fastdup.delete_or_retag_stats_outliers()
Relevant log output