gmbecker / genbankr

http://bioconductor.org/packages/devel/bioc/html/genbankr.html
14 stars 9 forks source link

Running into C stack issues with readGenBank function #16

Open Lneves23 opened 2 years ago

Lneves23 commented 2 years ago

Hello! Love the GenBank parser, but I am struggling to read in larger files.

In my hands, readGenBank works perfectly with smaller GenBank files (<500 KB), but gives me a C stack usage error with anything larger.

In the example below chrI is 442Kb and chrIII is 638Kb.

chrI <- readGenBank("/Users/lauren.neves/Desktop/chrI.gb") Translation product seems to be missing for 1 of 101 CDS annotations. Setting to '' No exons read from genbank file. Assuming sections of CDS are full exons No transcript features (mRNA) found, using spans of CDSs

chrIII <- readGenBank("/Users/lauren.neves/Desktop/chrIII.gb") Error: C stack usage 7970208 is too close to the limit

I've tried other files as well - larger files always throw this error. Small files work just fine. I am currently running genbankr version 1.20.0 on R version 4.1.2.

Am I doing something wrong and/or do the larger files need some pre-treatment?

Thanks!