huizhanglab-jhu / ms-pycloud

Apache License 2.0
4 stars 0 forks source link

MS-PyCloud

MS-PyCloud is a comprehensive software for mass spectrometry-based proteomics data analysis, encompassing peptide identification, protein inference, and quantitation of proteins, phosphosites, and glycan-specific glycopeptides. It processes raw Thermo LC-MS/MS data converted to mzML format, performing searches using GPQuest for glycan databases and MS-GF+ for protein databases, with results filtered based on PSM-level false discovery rates. Protein inference utilizes a bipartite graph analysis algorithm to group significant PSMs, assigning shared peptides to the most supported proteins. Quantitation supports various isobaric tags, including iTRAQ4, iTRAQ8, TMT10, TMT11, TMT16, and TMT18, and employs median normalization for accurate abundance, intensity, and Log2 ratios calculations. Enhancements like SQLite databases for peptide fragment indexing, Numba for JIT compilation, and a Streamlit GUI improve performance and usability. Integrated with AWS for scalable cloud computing, MS-PyCloud ensures efficient and high-availability peptide analysis.

Preprint and Software

For those interested in our research and preliminary findings, the preprint manuscript is available for review. You can find the document at the following link:

Software Availability

Latest release: A release version of MS-PyCloud with electron UI is ready in the releases section. 06/12/2024

Download

Contact Us

For any further inquiries or feedback, please don't hesitate to contact us via email at yhu39@jhmi.edu. Your input is invaluable to us as we strive to improve ms-pycloud and its supporting documentation.

References

Software Tools

Algorithm