klb2 / bibtex-tools

Collection of tools to work with and modernize BibTeX files.
https://pypi.org/project/bibtextools/
GNU General Public License v3.0
1 stars 0 forks source link

Add removal of duplicate entries based on content; Close #2 #4

Closed klb2 closed 3 hours ago

klb2 commented 3 hours ago

A new clean_bib_file.remove_duplicate_entries function is added. This will now remove duplicate entries based on their content (similar title, authors, ...) immediately after loading the bib files.

By default, this feature is interactive, i.e., it will show the user two identical entries and waits until the user makes a decision which one of them should be deleted. However, there is an additional --force flag that can be set to skip this interactive prompt and delete the entry with less fields.

This should address #2 by not just looking for duplicate IDs, but also duplicate content.