faircloth-lab / phyluce

software for UCE (and general) phylogenomics
http://phyluce.readthedocs.org/
Other
80 stars 49 forks source link

phyluce_assembly_assemblo_spades [contig length issue] #344

Open kimchafer opened 4 months ago

kimchafer commented 4 months ago

Hello, I am conducting research using myBaits UCE Coleoptera 1.1Kv1.

I followed the 'Phyluce Tutorial' exactly as described, and I am currently at the "Assemble the data" step, using 'phyluce_assembly_assemblo_spades' for the assembly.

However, I suspect that the length of the contigs produced by the analysis is abnormal. Here is the information:

스크린샷 2024-07-12 오전 12 20 39

Could you advise whether this result is plausible? If not, at which step is it likely that the problem occurred?

brantfaircloth commented 4 months ago

I think that these are values just after assembly? If so, those look pretty plausible. I'd go ahead and extract the UCE contigs from the entirety of contigs you assembled and re-evaluate the stats.

kimchafer commented 2 months ago

I am sorry for the late reply.

Following your recommendation, I continued the analysis and was able to recover more than 600 contigs during the 'Finding UCE loci' stage.

스크린샷 2024-09-04 오후 1 00 23

However, there is an oddity: the ratio of target loci contigs recovered to the total number of assembled contigs is very low (the highest is only 0.45%, and the lowest is 0.13%). Even the highest figure is significantly lower compared to the examples in the tutorial. Could this be normal, or might there have been an issue with the enrichment of target loci during the probe hybridization stage?

I would appreciate your opinion based on your experience. Thank you.

brantfaircloth commented 2 months ago

It's definitely possible - that ratio of UCE contigs to total contigs sounds extremely low. To me, it would suggest an enrichment issue.

kimchafer commented 2 months ago

Ok then this problem may stem from the molecular work step. I will take extra care during the next experiment especially in probe hybridization. I plan to sequence another test dataset soon and will share the results with you once that is completed at this issue.