AI-Driven Transcriptome Prediction in Human Pathology: From Molecular Insights to Clinical Applications

Biology (Basel). 2025 Jun 4;14(6):651. doi: 10.3390/biology14060651.

Abstract

Gene expression regulation underpins cellular function and disease progression, yet its complexity and the limitations of conventional detection methods hinder clinical translation. In this review, we define "predict" as the AI-driven inference of gene expression levels and regulatory mechanisms from non-invasive multimodal data (e.g., histopathology images, genomic sequences, and electronic health records) instead of direct molecular assays. We systematically examine and analyze the current approaches for predicting gene expression and diagnosing diseases, highlighting their respective advantages and limitations. Machine learning algorithms and deep learning models excel in extracting meaningful features from diverse biomedical modalities, enabling tools like PathChat and Prov-GigaPath to improve cancer subtyping, therapy response prediction, and biomarker discovery. Despite significant progress, persistent challenges-such as data heterogeneity, noise, and ethical issues including privacy and algorithmic bias-still limit broad clinical adoption. Emerging solutions like cross-modal pretraining frameworks, federated learning, and fairness-aware model design aim to overcome these barriers. Case studies in precision oncology illustrate AI's ability to decode tumor ecosystems and predict treatment outcomes. By harmonizing multimodal data and advancing ethical AI practices, this field holds immense potential to propel personalized medicine forward, although further innovation is needed to address the issues of scalability, interpretability, and equitable deployment.

Keywords: artificial intelligence; deep learning; multimodal data fusion; precision medicine; transcriptome prediction.

Publication types

  • Review