Open koppor opened 1 month ago
I want to take this issue.
/assign @leaf-soba
π Hey @,
Thanks for your interest in this issue! π
Newcomers, we're excited to have you on board. Start by exploring our Contributing guidelines, and don't forget to check out our workspace setup guidelines to get started smoothly.
In case you encounter failing tests during development, please check our developer FAQs!
Having any questions or issues? Feel free to ask here on GitHub. Need help setting up your local workspace? Join the conversation on JabRef's Gitter chat. And don't hesitate to open a (draft) pull request early on to show the direction it is heading towards. This way, you will receive valuable feedback.
β Note that this issue will become unassigned if it isn't closed within days.
π§ A maintainer can also add the **** label to prevent it from being unassigned automatically.
Happy coding! π
I want to check the next step is https://github.com/JabRef/jabref/issues/12139, or get a correct author/abstract?
!! This is more an issue to experiment with heuristics. How can a machine with "traditional" (non-AI) code create useful information. !!
When importing the PDF se2paper.pdf
one gets following BibTeX entry
However, the title should be better:
The properties of the file show
Tasks:
org.jabref.logic.importer.fileformat.PdfMergeMetadataImporter#importDatabase(java.nio.file.Path)
)stripper.setSortByPosition(true);
needs to be removed fromorg.jabref.logic.importer.fileformat.PdfContentImporter#getFirstPageContents
. Maybe, two methods need to be done to be able to parse the title (depending on positon) and parsing the abstract (more on content)Hint:
org.jabref.logic.importer.fileformat.PdfContentImporterTest
and Β΄org.jabref.logic.importer.fileformat.PdfMergeMetadataImporterTest`