Closed samseaver closed 2 months ago
Hi, I am having similar issue. I was able to parse the files into bed and peptide but getting following error. I would appreciate help on this. Thanks!
Checking Working Directory ... PASS: /Genespace/workingdir
Checking user-defined parameters ...
Genome IDs & ploidy ...
Sitalica_v2.2: 1
Sviridis_v4.1: 1
Taestivum : 1
Zmays : 1
Outgroup ... NONE
n. parallel processes ... 6
collinear block size ... 5
collinear block search radius ... 25
n gaps in collinear block ... 5
synteny buffer size... 100
only orthogroups hits as anchors ... TRUE
n secondary hits ... 0
Checking annotation files (.bed and peptide .fa):
Error: subscript contains out-of-bounds indices
Hello, I heard from Dr Lovell independently of this posted issue, the problem was that the identifiers in the two bed and fasta files didn't match. In my case, I had to make sure I used the "translated_cds" file from the NCBI.
Yeah. This is what happens when you give GENESPACE empty input files. Check the output of parse_annotations
... I bet something went wrong.
Hello again, Thank you for your help with this. I used gene.gff3 and protein.fa file. After parsing the bed file and peptide file GeneID matches. For example, I have attached a bed and peptide file to this comment. However, I am still getting an error saying the following. Could you please let me know how to fix this?
Checking user-defined parameters ...
Genome IDs & ploidy ...
Sitalica : 1
Sviridis : 1
Taestivum: 1
Zmays : 1
Outgroup ... NONE
n. parallel processes ... 6
collinear block size ... 5
collinear block search radius ... 25
n gaps in collinear block ... 5
synteny buffer size... 100
only orthogroups hits as anchors ... TRUE
n secondary hits ... 0
Checking annotation files (.bed and peptide .fa):
Sitalica : 34584 / 34584 geneIDs exactly match (PASS)
Sviridis : 29807 / 29807 geneIDs exactly match (PASS)
Taestivum: 99386 / 99386 geneIDs exactly match (PASS)
Zmays : 39756 / 39756 geneIDs exactly match (PASS)
Checking dependencies ...
Found valid path to OrthoFinder v2.55: orthofinder
Found valid path to DIAMOND2 v2.19: diamond
Found valid MCScanX_h executable: /Users/ ![Screen Shot 2024-03-14 at 9 43 25 AM](https://github.com/jtlovell/GENESPACE/assets/59696208/d1f5485e-261a-4cb9-b427-f3c20ce7d2c8) ![Screen Shot 2024-03-14 at 9 43 46 AM](https://github.com/jtlovell/GENESPACE/assets/59696208/ca792d0f-54fa-4163-8661-c6d37dee6cbc) Downloads/MCScanX-master/MCScanX_h
out <- run_genespace(gpar, overwrite = T)
############################
Try removing the /orthofinder subdirectory (in your genespace working directory) and re-running.
Hi, so I've got a set of Phytozome genomes on which I was able to run parse_annotations() cleanly, and so I get the corresponding bed and peptide fasta files. But when I run init_genespace, though it recognizes all the genomes, it errors out, so here's the output, and the result of the traceback() function. Unfortunately, I'm not familiar with R or the code, so I'm not sure how to follow this:
I'm using version v1.3.1 as installed via devtools: