Added a file containing the database sha256 checksum (no automatic validation yet, though linked to it in the README).
GenerateDatabase.sh
Now is able to use the separate .ttl files instead of requiring a dump-file.
Split downloading/preparing of downloads in 2 separate steps.
Extracting of gzipped data is done parallel.
Each step now checks if the previous step was done if previous step is not executed (note: just a simple directory/file presence check, does not validate whether contents in directory are valid so if an error occurred while creating it, this might still cause issues).
Updates to info.txt file generating.
Shows help message if no arguments are given (instead of running multiple steps by default).
Added TurtleFinder.sh script that allows for easily searching for information in generated .ttl files (not the initial source .ttl files!) when needing to update IT-tests.
Updated checksums.
SPARQL queries adjusted for new DisGeNET version (and some minor other improvements).
Checksum files now contain their used algorithm as file extension.
Changed leading tabs to leading spaces in bash scripts (to be similar with VIP repo's).
Database
* = still also includes certain files from v5.0.0
Changes
README.md
file.LICENSES.md
file.GenerateDatabase.sh
.ttl
files instead of requiring a dump-file.info.txt
file generating.TurtleFinder.sh
script that allows for easily searching for information in generated.ttl
files (not the initial source.ttl
files!) when needing to update IT-tests.