Enhancer-driven gene regulatory networks inference from single-cell RNA-seq and ATAC-seq data

Brief Bioinform. 2024 Jul 25;25(5):bbae369. doi: 10.1093/bib/bbae369.

Abstract

Deciphering the intricate relationships between transcription factors (TFs), enhancers, and genes through the inference of enhancer-driven gene regulatory networks (eGRNs) is crucial in understanding gene regulatory programs in a complex biological system. This study introduces STREAM, a novel method that leverages a Steiner forest problem model, a hybrid biclustering pipeline, and submodular optimization to infer eGRNs from jointly profiled single-cell transcriptome and chromatin accessibility data. Compared to existing methods, STREAM demonstrates enhanced performance in terms of TF recovery, TF-enhancer linkage prediction, and enhancer-gene relation discovery. Application of STREAM to an Alzheimer's disease dataset and a diffuse small lymphocytic lymphoma dataset reveals its ability to identify TF-enhancer-gene relations associated with pseudotime, as well as key TF-enhancer-gene relations and TF cooperation underlying tumor cells.

Keywords: Steiner forest problem model; biological network; data integration; scATAC-seq; scRNA-seq; submodular optimization.

MeSH terms

  • Algorithms
  • Alzheimer Disease / genetics
  • Alzheimer Disease / metabolism
  • Chromatin Immunoprecipitation Sequencing
  • Computational Biology / methods
  • Enhancer Elements, Genetic*
  • Gene Regulatory Networks*
  • Humans
  • RNA-Seq*
  • Single-Cell Analysis* / methods
  • Single-Cell Gene Expression Analysis
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • Transcription Factors