AI Pipeline Finds Hidden Cancer Targets That Trigger Real T Cell Attacks
A new machine learning tool called MaNeo scans tumor peptides to identify neoantigens that activate T cells against cancer while sparing healthy cells.
Summary
Researchers built a massive atlas of cancer-presented peptides from 531 tumor and normal tissue samples, then developed an AI screening tool called MaNeo to identify neoantigens — mutant protein fragments displayed on cancer cell surfaces that flag tumors for immune destruction. The atlas covers 14 cancer types and 29 normal tissues, cataloguing over 459,000 peptides including previously overlooked 'noncanonical' sources. MaNeo combines immunopeptidomics data with machine learning to rank candidate neoantigens by their likelihood of triggering T cell responses. Three top candidates were validated in cancer cell lines, successfully activating T cell proliferation and tumor killing without harming healthy cells — a key step toward personalized cancer vaccines.
Detailed Summary
Cancer immunotherapy depends on identifying neoantigens — peptide fragments derived from tumor-specific mutations that are displayed on the surface of cancer cells by MHC (major histocompatibility complex) molecules, where they can be recognized and attacked by T cells. Finding the right neoantigens is technically difficult because most computational predictions rely on genomic data alone and generate high false-positive rates. This study addresses that bottleneck by grounding neoantigen discovery in direct mass spectrometry evidence of what peptides tumors actually present.
The team assembled a pan-cancer immunopeptidomics atlas from 531 samples spanning 14 cancer types and 29 normal tissue types. Mass spectrometry identified 389,165 canonical peptides (derived from standard protein-coding sequences) and 70,270 noncanonical peptides (derived from non-coding regions, alternative reading frames, fusion genes, and other atypical sources). A critical finding was that noncanonical peptides were presented at comparable abundance levels to canonical peptides across cancer types — challenging the common assumption that noncanonical sources are negligible and suggesting they represent a largely untapped pool of immunotherapy targets.
Tumor-specific peptides showed statistically significant differences in biochemical features compared to those found in normal tissues, including differences in hydrophobicity, charge, and MHC binding motifs. These distinctions formed the biological rationale for the MaNeo (machine learning-based neoantigen) pipeline. MaNeo integrates multiple layers of evidence: somatic mutation data, MHC-peptide binding predictions, immunopeptidomics-derived presentation likelihood, and T cell recognition scores. Benchmark analyses demonstrated that MaNeo outperformed existing neoantigen prioritization tools in accurately identifying both shared (public) and tumor-specific (private) neoantigens from canonical and noncanonical sources, achieving superior area under the curve metrics compared to state-of-the-art alternatives.
For experimental validation, the authors applied MaNeo to cancer cell lines and identified three high-priority neo-peptides. These candidates were synthesized and tested in co-culture experiments with human T cells. All three neo-peptides induced significantly increased T cell proliferation and cytotoxic T lymphocyte (CTL) activity, with T cells effectively killing tumor cells expressing the cognate neoantigens. Crucially, the activated T cells did not damage normal cells displaying wild-type peptides, demonstrating tumor selectivity — the defining requirement for a viable immunotherapy target.
The pan-cancer peptide atlas and the MaNeo pipeline together represent a substantial methodological advance for personalized cancer vaccine development. By anchoring predictions in real immunopeptidomics data rather than purely computational binding affinity scores, MaNeo reduces false positives and surfaces noncanonical neoantigens that would otherwise be invisible. Limitations include the fact that validation was conducted in cell lines rather than animal models or patients, and the atlas's coverage, while broad, does not yet capture the full heterogeneity of tumor mutational landscapes across individuals and cancer subtypes.
Key Findings
- Pan-cancer peptide atlas built from 531 samples across 14 cancer and 29 normal tissue types identified 389,165 canonical and 70,270 noncanonical MHC-presented peptides
- Noncanonical peptides (from non-coding regions, alternative reading frames, and fusion genes) were presented at comparable levels to canonical peptides across all cancer types studied
- Tumor-specific peptides showed statistically significant differences in hydrophobicity, charge, and MHC binding motifs compared to normal tissue peptides
- MaNeo pipeline achieved superior benchmark performance over existing neoantigen prioritization tools in identifying both shared and tumor-specific canonical and noncanonical neo-peptides
- Three MaNeo-prioritized neo-peptides validated in cancer cell lines successfully induced significant T cell proliferation and cytotoxic T lymphocyte responses
- Activated T cells killed tumor cells expressing the target neoantigens but did not damage healthy cells displaying wild-type peptides, confirming tumor selectivity
Methodology
The study integrated publicly available and newly generated immunopeptidomics mass spectrometry data from 531 human samples across 14 cancer types and 29 normal tissue types to construct a pan-cancer peptide atlas. The MaNeo pipeline incorporates somatic mutation calling, MHC-peptide binding prediction, immunopeptidomics presentation scoring, and T cell recognition models into a unified machine learning framework. Benchmark comparisons were conducted against established neoantigen prediction tools using area under the curve (AUC) metrics on curated datasets of known immunogenic peptides. Experimental validation used cancer cell line co-culture assays measuring T cell proliferation by flow cytometry and cytotoxic killing assays comparing tumor versus normal cell death.
Study Limitations
The experimental validation of the three neo-peptides was performed exclusively in cancer cell lines rather than in animal tumor models or human clinical samples, limiting confidence on in vivo efficacy and immune context. The pan-cancer atlas, while covering 531 samples across 14 cancer types, does not capture the full intratumoral heterogeneity or rare cancer subtypes that may require separate modeling. As with all immunopeptidomics-driven approaches, mass spectrometry sensitivity limits detection of low-abundance peptides, so some clinically relevant neoantigens may be missed.
Enjoyed this summary?
Get the latest longevity research delivered to your inbox every week.
