popbr / abstract_doi_finder

A simple tool to retrieve DOIs and abstracts of research articles inserted as spreadsheet
Apache License 2.0
0 stars 3 forks source link

Abstract & DOI Finder

A simple tool to retrieve DOIs and abstracts of research articles inserted as spreadsheet.

How-to Use

[!NOTE] If you want to run execute the program on a different sheet, simply upload your updated spreadsheet and head over to the "Actions" tab: if you there is only one sheet in the input/ folder and the workflows have already been enabled, then they will execute automatically when the file in input/ is edited.

Spreadsheet Requirements

[!CAUTION] Be aware that

  1. executing our program as indicated above (that is, on github servers) will make your spreadsheet publicly accessible while the program is being executed, and after if you don't delete your fork.
  2. even if our program is not supposed to edit your spreadsheet, always backing up your data before running this program is a good practise.

Your speardsheet must

Your spreadsheet can

Your spreadsheet must not

[!WARNING]
The program will not override your data, but create a copy of the original sheet. Furthermore, if you have data in existing cells for both the abstract and DOI, the program will not overwrite your data. However, if even one of them is missing, the program will search PubMed for the abstract and DOI, then overwrite the data in its output spreadsheet.

An example sheet that can be used as a template is provided.

How-to Compile and Execute

Pre-requisites

Compiling and Running the Program

cd abstract_doi_finder/
mvn compile
mvn exec:java -Dexec.mainClass="popbr.AbstractDoiFinder" -Dexec.args="input_file.xlsx 1,3"

Arguments

In the command line above,

The range can be provided (or omitted) in a multitude of ways including:

[!NOTE]
When entering your sheet range, it is important to take note that each number is subtracted by 1, so it runs on your intended files. For example: The user inputs '1,2,3' which would run on the first sheet, the second sheet, and the third. But the program will use the indices 0, 1, and 2 to run on the specified sheets. Therefore, it is important to not use 0, as this was done for user convenience.