PaulLereverend / NextcloudDuplicateFinder

Save some space by finding your duplicate files
GNU Affero General Public License v3.0
76 stars 16 forks source link

Howto: Batch deleting found duplicate files #94

Open schorfi opened 1 year ago

schorfi commented 1 year ago

Hello, I am facing a huge amount of duplicates due to some redundant upload of files from users after the files got sorted and archived. The UI of the duplicate finder app is providing only limited progress (just 40 files, one click one action) - esp if facing potential 20k of duplicated files.

So, I was asking myself whether it is possible to delete found duplicates based on a rule (e.g. path prefix to delete preferred on upload directories). This could be in the UI or in the command line. For the latter how could this be done or how can I get the output for all found files which I could process in a batch/shell/python script for automated deletion.

Is there an existing way to do that?

MarkoPaasila commented 1 year ago

Sync to a big enough ssd, then run rmlint.

ashish-khokhar commented 1 year ago

@MarkoPaasila Yes that's one of the options, However, we need this feature in this plugin, so I think your suggestion is out of context.