InstaNovo-P: A de novo peptide sequencing model for phosphoproteomics

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

InstaNovo-P: A de novo peptide sequencing model for phosphoproteomics

Authors

Lauridsen, J.; Ramasamy, P.; Catzel, R.; Canbay, V.; Mabona, A.; Eloff, K.; Fullwood, P.; Ferguson, J.; Kirketerp-Moller, A.; Goldschmidt, I. S.; Claeys, T.; van Puyenbroeck, S.; Lopez Carranza, N.; Schoof, E. M.; Martens, L.; Van Goey, J.; Francavilla, C.; Jenkins, T. P.; Kalogeropoulos, K.

Abstract

Phosphorylation, a crucial post-translational modification (PTM), plays a central role in cellular signaling and disease mechanisms. Mass spectrometry-based phosphoproteomics is widely used for system-wide characterization of phosphorylation events. However, traditional methods struggle with accurate phosphorylated site localization, complex search spaces, and detecting sequences outside the reference database. Advances in de novo peptide sequencing offer opportunities to address these limitations, but have yet to become integrated and adapted for phosphoproteomics datasets. Here, we present InstaNovo-P, a phosphorylation specific version of our transformer-based InstaNovo model, fine-tuned on extensive phosphoproteomics datasets. InstaNovo-P significantly surpasses existing methods in phosphorylated peptide detection and phosphorylated site localization accuracy across multiple datasets, including complex experimental scenarios. Our model robustly identifies peptides with single and multiple phosphorylated sites, effectively localizing phosphorylation events on serine, threonine, and tyrosine residues. We experimentally validate our model predictions by studying FGFR2 signaling, further demonstrating that InstaNovo-P uncovers phosphorylated sites previously missed by traditional database searches. These predictions align with critical biological processes, confirming the model\'s capacity to yield valuable biological insights. InstaNovo-P adds value to phosphoproteomics experiments by effectively identifying biologically relevant phosphorylation events without prior information, providing a powerful analytical tool for the dissection of signaling pathways.

Follow Us on

0 comments

Add comment