ChenglongMa / zoplicate

A plugin that does one thing only: Detect and manage duplicate items in Zotero.
https://chenglongma.com/zoplicate/
GNU Affero General Public License v3.0
355 stars 2 forks source link

Request to add the function of detecting and deleting duplicate PDFs #47

Open hasibagen opened 4 months ago

hasibagen commented 4 months ago

Because the merged PDFs will exist at the same time, there may be many documents with the same duplicate PDFs. Can you add the function to detect these PDFs and delete them?

ChenglongMa commented 4 months ago

Good idea! Thanks, @hasibagen, I will add this function later.

hasibagen commented 4 months ago

Good idea! Thanks, @hasibagen, I will add this function later.

Really looking forward to this function!

fredericky123 commented 2 months ago

目前这个功能有完善吗?一方面是删除多个重复的pdf,一方面想要用正式出版的pdf去替换accept的pdf,同时将早前的pdf笔记移植到新的正式出版的pdf上,这样可能吗

fredericky123 commented 2 months ago

或者pdf的优先级可以这样设定,保留有注释的版本;保留有卷号,期号的版本;保留最新的版本;保留最旧的版本

ChenglongMa commented 2 months ago

不好意思,最近有点忙。不过我在一点点实现这个功能,主要除了这个功能本身还要考虑和其他插件的兼容性。我会尽快更新的,多谢你的关注和宝贵意见!

fredericky123 commented 2 months ago

感谢感谢,期待!

ChenglongMa commented 1 month ago

Hi @hasibagen and @fredericky123,

Thank you for your valuable suggestions. I'm implementing this function, but I may need your help.

The built-in merge function in Zotero will remove duplicate PDFs only when they:

  1. have exactly the same content;
  2. have the same content type, e.g., application/pdf;
  3. have the same link mode, e.g., both are imported or linked, like this:

    Snipaste_2024-08-09_09-43-30

These criteria are so strict that some duplicate files cannot be recognized.

Here I sincerely ask for more help from you:

  1. @hasibagen Could you give me some examples of exceptions where they are duplicates but do not meet the above criteria?
  2. @fredericky123 您提到的笔记移植和优先级的设置非常好!不过如何区分"正式版本"和"accept的版本"?是指arxiv中preprint的版本吗?如果方便的话能否提供一个例子?

Thank you so much for your feedback and support!

Chenglong

fredericky123 commented 1 month ago

这是同一篇文章的接收版本和in print版本 Um et al_2021_Acad Manage J_The downside of CFO function-based language incongruity.pdf [Uploading um-et-al-2022-the-downside-of-cfo-function-based-language-incongruity.pdf…]()

ChenglongMa commented 1 month ago

这是同一篇文章的接收版本和in print版本 Um et al_2021_Acad Manage J_The downside of CFO function-based language incongruity.pdf Uploading um-et-al-2022-the-downside-of-cfo-function-based-language-incongruity.pdf…

非常感谢!我对比一下

ChenglongMa commented 1 month ago

@fredericky123,不好意思,您第二个链接打不开

fredericky123 commented 1 month ago

um-et-al-2022-the-downside-of-cfo-function-based-language-incongruity.pdf 这样呢

ChenglongMa commented 1 month ago

@fredericky123 收到了,非常感谢!