Open aleksicmil opened 1 year ago
Followup:
I continued working on this today as I figured there must be something wrong with the gff file. Since I am working with corn, I tried using ftp://ftp.ensemblgenomes.org/pub/plants/release-34/gff3/zea_mays/Zea_mays.AGPv4.34.gff3.gz
from https://github.com/baoxingsong/genomeAlignment/tree/main/alignb73againstmo17. The only change I did to the file was to remove the scaffolds.
So now, I am able to create refCDS.fa that is not empty and the SAM file, but when I run proali using the following command:
anchorwave proali -i Zea_mays.AGPv4.34.noscaffolds.gff3
-r MAIZE_B73.fa
-as refCDS.fa
-a MAIZE_DK105.sam
-ar MAIZE_B73.sam
-s MAIZE_DK105.fa
-n MAIZE_DK105_MAIZE_B73.anchorspro
-R 1 -Q 1 -t 4
-o MAIZE_DK105.maf
I am getting this error:
setupAnchorsWithanchorwave_avx2: /opt/conda/conda-bld/anchorwave_1676953230719/work/src/service/TransferGffWithNucmerResult.cpp:450: void readSam(std::vector<AlignmentMatch>&, std::ifstream&, std::map<std::__cxx11::basic_string<char>, Transcript>&, int&, const double&, double&, std::set<std::__cxx11::basic_string<char> >&, int32_t&, int32_t&, int32_t&, int32_t&, int&, bool&, int&): Assertion `transcriptHashMap[elems[0]].getPStart() == chromosomePosition' failed.
/opt/miniconda/bin/anchorwave: line 22: 184 Aborted (core dumped) "${EXE}_avx2" "$@"
By checking some other issues on this page (Issues #2 or #16), seems to me like the issue is still in the GFF file, which as a result creates SAM file that anchorwave can't process.
Could you please share some guidelines on how I can ensure that my GFF file is ok? Or, if there are any other things that I need to ensure before running proali?
Thank you very much in advance, your help is much appreciated.
Best,
Milica
Hi,
I am trying to run anchorwave as an alignment step of the buckler lab PHG pipeline.
proali step is failing, and seems to me like the problem is that refCDS.fa (and all other files generated in the preparatory steps) are empty. AssemblyMAFFromAnchorWavePlugin from PHG runs the following command:
My GFF file looks like this (just first gene is showed):
Could you please tell me what could cause the issue with an empty output refCDS.fa file? Is there something in the GFF that is not correct? Thanks.
Best,
Milica