visual-layer / fastdup

fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
Other
1.52k stars 74 forks source link

[Bug]: fastdup overwrites similarity.csv when calling run_mode=4 or search #252

Open Tompil3r opened 11 months ago

Tompil3r commented 11 months ago

What happened?

When calling run(run_mode=4) or search - which both generate new similarities, fastdup overwrites the similarity.csv in the work_dir. Meaning in order to keep the similarities a copy of the file needs to be made each time.

What did you expect to see?

No response

What version of fastdup were you runnning on?

1.30

What version of Python were you running on?

Python 3.10

Operating System

Ubuntu 20

Reproduction steps

No response

Relevant log output

No response

Attach a screenshot [Optional]

No response

Contact Details [Optional]

No response

dnth commented 10 months ago

I find that this is the line that overwrites the similarity.csv file.

https://github.com/visual-layer/fastdup/blob/bf302fa499137f92973057b2bd75f62d5e00ec59/fastdup/__init__.py#L2110

@dbickson could you please check on the C side?