AstrobioMike / GToTree

A user-friendly workflow for phylogenomics
GNU General Public License v3.0
192 stars 25 forks source link

pfamA.txt.gz missing from ftp? #84

Closed SandersonHaley closed 4 months ago

SandersonHaley commented 4 months ago

Hi, I am trying to get the Pfam from ftp to set up the Bacteria.hmm file for gtotree and got this output:

--2024-02-09 16:29:36-- ftp://ftp.ebi.ac.uk/pub/databases/Pfam/releases/Pfam32.0/database_files/pfamA.txt.gz => ‘pfamA.txt.gz’ Resolving ftp.ebi.ac.uk (ftp.ebi.ac.uk)... 193.62.193.165 Connecting to ftp.ebi.ac.uk (ftp.ebi.ac.uk)|193.62.193.165|:21... connected. Logging in as anonymous ... Logged in! ==> SYST ... done. ==> PWD ... done. ==> TYPE I ... done. ==> CWD (1) /pub/databases/Pfam/releases/Pfam32.0/database_files ... No such directory ‘pub/databases/Pfam/releases/Pfam32.0/database_files’.

Has the file been renamed or moved?

AstrobioMike commented 4 months ago

Hi there :)

Hmm, it does seem the "database_files/" directory is missing from the Pfam32.0 directory currently :/ I don't host or have anything to do with Pfam or Interpro though, so i can't speak to why or where it went.

It does exist for Pfam33.0 though, at https://ftp.ebi.ac.uk/pub/databases/Pfam/releases/Pfam33.0/database_files/pfamA.txt.gz. And in the latest release of Pfam36.0, at https://ftp.ebi.ac.uk/pub/databases/Pfam/releases/Pfam36.0/database_files/pfamA.txt.gz

So you could try doing the process with a new version (being sure to use the same version of Pfam-A.hmm.gz too of course), but i can't say for sure all the code from my initial building from Pfam32.0 years ago would work the same if anything changed in the formatting (assuming that's what you were looking at when you came across this issue).

But also, just to be sure, you don't need to create that, as it (and others) are available through GToTree (you can see them all by running gtt-hmms) – unless of course you're trying to re-build it for yourself for another reason :)

SandersonHaley commented 4 months ago

Thanks, I found the files for Pfam 36.0.