A roadmap for T cell receptor-peptide-bound major histocompatibility complex binding prediction by machine learning: glimpse and foresight

Brief Bioinform. 2025 Jul 2;26(4):bbaf327. doi: 10.1093/bib/bbaf327.

Abstract

Cytotoxic T lymphocytes (CTLs) play a key role in the defense of cancer and infectious diseases. CTLs are mainly activated by T cell receptors (TCRs) after recognizing the peptide-bound class I major histocompatibility complex, and subsequently kill virus-infected cells and tumor cells. Therefore, identification of antigen-specific CTLs and their TCRs is a promising agent for T-cell based intervention. Currently, the experimental identification and validation of antigen-specific CTLs is well-used but extremely resource-intensive. The machine learning methods for TCR-pMHC prediction are growing interest particularly with advances in single-cell technologies. This review clarifies the key biological processes involved in TCR-pMHC binding. After comprehensively comparing the advantages and disadvantages of several state-of-the-art machine learning algorithms for TCR-pMHC prediction, we point out the discrepancies with these machine learning methods under specific disease conditions. Finally, we proposed a roadmap of TCR-pMHC prediction. This roadmap would enable more accurate TCR-pMHC binding prediction when improving data quality, encoding and embedding methods, training models, and application context. This review could facilitate the development of T-cell based vaccines and therapy.

Keywords: TCR-pMHC; data quality; deep learning; encoding; prediction.

Publication types

  • Review

MeSH terms

  • Humans
  • Machine Learning*
  • Major Histocompatibility Complex*
  • Peptides* / immunology
  • Peptides* / metabolism
  • Protein Binding
  • Receptors, Antigen, T-Cell* / immunology
  • Receptors, Antigen, T-Cell* / metabolism
  • T-Lymphocytes, Cytotoxic / immunology
  • T-Lymphocytes, Cytotoxic / metabolism

Substances

  • Receptors, Antigen, T-Cell
  • Peptides