We present a protocol for obtaining cancer type and subtype predictions using a machine learning method (subSCOPE). We describe steps for data preparation, subSCOPE setup, and running subSCOPE inference on prepared data. The protocol supports five -omics data types as input (DNA methylation, gene expression, microRNA [miRNA] expression, point mutations, and copy-number variants) and allows individual cancer type and data type selection. For non-The Cancer Genome Atlas (TCGA) cancer samples, it provides subtype-level classification across 26 different TCGA cancer cohorts and 106 subtypes. For complete details on the use and execution of this protocol, please refer to Ellrott et al.1.
Keywords: Bioinformatics; Cancer; Computer sciences; Genomics.
Copyright © 2025 The Author(s). Published by Elsevier Inc. All rights reserved.