Closed josephhearnshaw closed 3 years ago
@josephhearnshaw, some paths have changed:
FASTA and GFF3: 2#2 The argument is invalid -- The file /home/data/knetminer/species/fungi/fusarium_2020/fungi/organisms/fusarium_culmorum/ensembl/fusarium_culmorum_pep_all.fa does not exist and is required to do so for Fasta File. Absolute path: /home/data/knetminer/species/fungi/fusarium_2020/fungi/organisms/fusarium_culmorum/ensembl/fusarium_culmorum_pep_all.fa Absolute path to a FASTA input file with protein secuences
FASTA and GFF3: 2#2 The argument is invalid -- The file /home/data/knetminer/species/fungi/fusarium_2020/fungi/organisms/fusarium_culmorum/ensembl/fusarium_culmorum_gene_protein_mapping.txt does not exist and is required to do so for Mapping File. Absolute path: /home/data/knetminer/species/fungi/fusarium_2020/fungi/organisms/fusarium_culmorum/ensembl/fusarium_culmorum_gene_protein_mapping.txt Absolute path to a mapping input file which provides mapping relationsship between the GFF and the FASTA file. It should contain two columns: 2) gene id and 4) protein id
FASTA and GFF3: 2#2 The argument is invalid -- The file /home/data/knetminer/species/fungi/fusarium_2020/fungi/organisms/fusarium_culmorum/ensembl/fusarium_culmorum_genes.gff3 does not exist and is required to do so for GFF3 File. Absolute path: /home/data/knetminer/species/fungi/fusarium_2020/fungi/organisms/fusarium_culmorum/ensembl/fusarium_culmorum_genes.gff3 Absolute path to a GFF3 input file with 9 columns. It uses 1)chromosome id, 4)start, 5)end and 9)gene id and gene description i.e. "ID=PGSC0003DMG400030251;Name=""Conserved gene of unknown function"""
I cannot find those files, please fix their paths (new launching script is basePath
).
With the new launching script and without the FASTA parsing step, it completed without errors, the only strange thing is it took almost an hour to produce a small 1.4Mb OXL. I'm attaching the output here, for future reference.
Out of memory error occurs.
Test workflow and data available under
knetminer/test/mapping_bug/git_issue_30/
The input OMA data might have malformed accession datasources or ids.