Open ReneRanzinger opened 3 months ago
The sequence data only contains the proteins of the proteomes in glygen. The docker image contains all these sequence files.
I can take a look in the future to separate sequence files from the java tool
As @rykahsay mentioned an option would be a docker volume. Essentially a host folder that gets mounted into the container. For the Java program it still looks like as if it would read the file from its own file system. But it would allow the host to also access and update this files without the container even knowing it.
The isoform2canonical mapper from Preethi is not using any API to retrieve the sequence for isoforms or canonical proteins but rather should come with his own sequence data basis (not database, I actually think its a file or a folder of files):