The Integration and Growth of R in Soil Research: A 10-Year Analysis

Ecol Evol. 2025 Jun 26;15(7):e71545. doi: 10.1002/ece3.71545. eCollection 2025 Jul.

Abstract

The field of soil science has seen significant advancements in recent years, largely due to the integration of computational tools and statistical methods. Among these resources, the programming language R has emerged as a powerful and versatile platform for soil scientists, aiding in a spectrum of tasks from data analysis and modeling to visualization. Nonetheless, the broader trends and specific patterns of R usage in soil research have not been thoroughly documented. Our study investigated the prevalence of R and its package usage in 25,888 research articles published in 10 leading soil science journals over a decade, from 2014 to 2023. A considerable number of these articles, 7899 (or 30.5%), named R as their primary data analysis tool. The use of R has followed a steady linear growth pattern, rising from 13.9% in 2014 to 46.5% in 2023. The most commonly used R packages were "vegan," "ggplot2," "lme4," "nlme," and "randomForest," with each journal showcasing unique research focuses, resulting in varying frequencies of R package applications across different publications. Furthermore, there was a notable increase in the average number of R packages used per article throughout the study period. This research highlights the pivotal role of R, armed with its robust statistical and visualization capabilities, in enabling soil scientists to conduct comprehensive analyses and gain in-depth insights into the complex dimensions of soil science.

Keywords: R packages; R software; soil ecology; soil journal; soil science.