Using random forests to uncover the predictive power of distance-varying cell interactions in tumor microenvironments

PLoS Comput Biol. 2024 Jun 14;20(6):e1011361. doi: 10.1371/journal.pcbi.1011361. eCollection 2024 Jun.

Abstract

Tumor microenvironments (TMEs) contain vast amounts of information on patient's cancer through their cellular composition and the spatial distribution of tumor cells and immune cell populations. Exploring variations in TMEs between patient groups, as well as determining the extent to which this information can predict outcomes such as patient survival or treatment success with emerging immunotherapies, is of great interest. Moreover, in the face of a large number of cell interactions to consider, we often wish to identify specific interactions that are useful in making such predictions. We present an approach to achieve these goals based on summarizing spatial relationships in the TME using spatial K functions, and then applying functional data analysis and random forest models to both predict outcomes of interest and identify important spatial relationships. This approach is shown to be effective in simulation experiments at both identifying important spatial interactions while also controlling the false discovery rate. We further used the proposed approach to interrogate two real data sets of Multiplexed Ion Beam Images of TMEs in triple negative breast cancer and lung cancer patients. The methods proposed are publicly available in a companion R package funkycells.

MeSH terms

  • Algorithms
  • Cell Communication* / physiology
  • Computational Biology / methods
  • Computer Simulation
  • Female
  • Humans
  • Lung Neoplasms / immunology
  • Lung Neoplasms / pathology
  • Models, Biological
  • Neoplasms / immunology
  • Neoplasms / pathology
  • Random Forest
  • Triple Negative Breast Neoplasms / immunology
  • Triple Negative Breast Neoplasms / pathology
  • Tumor Microenvironment* / physiology

Grants and funding

M.L.A.L. is supported by funding from the Viertel Foundation Senior Medical Research Fellowship and the ‘ACRF Program for resolving cancer complexity and therapeutic resistance’, proudly supported by Australian Cancer Research Foundation. G.R. was supported by the Natural Science and Engineering Research Council of Canada, Discover Grant RGPIN-03723. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.