A roadmap for T cell receptor-peptide-bound major histocompatibility complex binding prediction by machine learning: glimpse and foresight

Furong Qi; Qiang Huang; Yao Xuan; Yingyin Cao; Yunyun Shen; Yihan Ren; Zhe Liu; Zheng Zhang

doi:10.1093/bib/bbaf327

A roadmap for T cell receptor-peptide-bound major histocompatibility complex binding prediction by machine learning: glimpse and foresight

Brief Bioinform. 2025 Jul 2;26(4):bbaf327. doi: 10.1093/bib/bbaf327.

Authors

Furong Qi¹, Qiang Huang¹, Yao Xuan¹, Yingyin Cao¹, Yunyun Shen¹, Yihan Ren¹, Zhe Liu¹, Zheng Zhang¹

Affiliation

¹ Institute for Hepatology, National Clinical Research Center for Infectious Disease, Shenzhen Third People's Hospital, The Second Affiliated Hospital, School of Medicine, Southern University of Science and Technology, Shenzhen 518112, Guangdong Province, China.

PMID: 40652935
DOI: 10.1093/bib/bbaf327

Abstract

Cytotoxic T lymphocytes (CTLs) play a key role in the defense of cancer and infectious diseases. CTLs are mainly activated by T cell receptors (TCRs) after recognizing the peptide-bound class I major histocompatibility complex, and subsequently kill virus-infected cells and tumor cells. Therefore, identification of antigen-specific CTLs and their TCRs is a promising agent for T-cell based intervention. Currently, the experimental identification and validation of antigen-specific CTLs is well-used but extremely resource-intensive. The machine learning methods for TCR-pMHC prediction are growing interest particularly with advances in single-cell technologies. This review clarifies the key biological processes involved in TCR-pMHC binding. After comprehensively comparing the advantages and disadvantages of several state-of-the-art machine learning algorithms for TCR-pMHC prediction, we point out the discrepancies with these machine learning methods under specific disease conditions. Finally, we proposed a roadmap of TCR-pMHC prediction. This roadmap would enable more accurate TCR-pMHC binding prediction when improving data quality, encoding and embedding methods, training models, and application context. This review could facilitate the development of T-cell based vaccines and therapy.

Keywords: TCR-pMHC; data quality; deep learning; encoding; prediction.

Publication types

Review

MeSH terms

Humans
Machine Learning*
Major Histocompatibility Complex*
Peptides* / immunology
Peptides* / metabolism
Protein Binding
Receptors, Antigen, T-Cell* / immunology
Receptors, Antigen, T-Cell* / metabolism
T-Lymphocytes, Cytotoxic / immunology
T-Lymphocytes, Cytotoxic / metabolism

Substances

Receptors, Antigen, T-Cell
Peptides

Abstract

Publication types

MeSH terms

Substances

Grants and funding