Open dsammour opened 4 years ago
Did you ever face such issues? What could you recommend in such situation? No. But I never had so much data.
Sparse matrices could be a solution especially with the on-disk vector feature.
@dsammour and @sgibb, Sorry for putting my inappropriate question here.
I also want to analyze my MALDI-FTICR data (MALDI profiling data, not MALDI imaging data ) with MALDIquant
. But I don't know how to convert them into the data types that are supported by MALDIquantForeign
. How did you convert your data? Thanks.
Dong
Hi @YonghuiDong, could you please open an issue in MALDIquantForeign
and provide more details about the data structure, perhaps with an example. Thanks.
Hi @dsammour , Thanks for your suggestion. I have opened an issue in MALDIquantForeign. Could you please have a look.
https://github.com/sgibb/MALDIquantForeign/issues/31
Thanks a lot.
Dong
Please have a look at PR #71 I've optimized the speed and memory usage for filterPeaks
. The bottleneck in that function is the binary matrix for peaks occurrence.
Thanks to @paoloinglese this is solved for filterPeaks
in #71 and #72, respectively (and just merged into master, not on CRAN yet).
Hi Sebastian,
I am currently working with massive
MassPeaks
lists of MALDI-FTICR data. By massive I meanEverything works flawlessly, but I noticed i) a huge memory usage (can reach up to 120 GBs!) when calling
mergeMassPeaks
and ii) huge memory usage + sometimes error messages when callingfilterMassPeaks
. Both were called afterbinPeaks
. The error message is as follows:I know that internally both functions construct intensity matrices which blows up memory usage. Did you ever face such issues? What could you recommend in such situation?
Suggestion For the internal construction of the intensity matrices, do you think it would be a better idea to construct spars matrices? for examples instead of the current implementation of
.as.matrix.MassObjectList
to use something like this :what do you think?