Protocol for obtaining cancer type and subtype predictions using subSCOPE

STAR Protoc. 2025 Jun 20;6(2):103705. doi: 10.1016/j.xpro.2025.103705. Epub 2025 Apr 10.

Abstract

We present a protocol for obtaining cancer type and subtype predictions using a machine learning method (subSCOPE). We describe steps for data preparation, subSCOPE setup, and running subSCOPE inference on prepared data. The protocol supports five -omics data types as input (DNA methylation, gene expression, microRNA [miRNA] expression, point mutations, and copy-number variants) and allows individual cancer type and data type selection. For non-The Cancer Genome Atlas (TCGA) cancer samples, it provides subtype-level classification across 26 different TCGA cancer cohorts and 106 subtypes. For complete details on the use and execution of this protocol, please refer to Ellrott et al.1.

Keywords: Bioinformatics; Cancer; Computer sciences; Genomics.

MeSH terms

  • Computational Biology* / methods
  • DNA Copy Number Variations / genetics
  • DNA Methylation / genetics
  • Humans
  • Machine Learning*
  • MicroRNAs / genetics
  • Neoplasms* / classification
  • Neoplasms* / diagnosis
  • Neoplasms* / genetics

Substances

  • MicroRNAs