bittremieux / falcon

Large-scale tandem mass spectrum clustering using fast nearest neighbor searching.
BSD 3-Clause "New" or "Revised" License
24 stars 7 forks source link

Fast I/O with lance #25

Open Janne98 opened 4 months ago

Janne98 commented 4 months ago

1. Improved I/O Write spectra to a single lance file instead of many pickle files. Fast spectrum retrieval with index.

2. Updated binning strategy Removed assigning spectra to mass intervals, instead, cluster spectra per charge.

3. Improved NN searching Create 1 big index per charge. Search for NN using a mask based on precursor m/z