in

Optimizing environmental covariates for digital mapping of soil organic carbon in Hunan Province, China


Abstract

Soil organic carbon (SOC) is widely recognized as a fundamental indicator of soil fertility, ecosystem functioning, and overall soil health. Effective land management requires continuous monitoring of SOC variations through modern technological approaches. In this study, 477 soil samples were meticulously collected and analyzed for SOC content in the laboratory. Terrain attributes and spectral indices were then derived from satellite data. Machine learning models, including support vector machine (SVM), artificial neural network (ANN), and random forest (RF), were employed to predict SOC content. To improve computational efficiency and model accuracy, the variance inflation factor (VIF) and Boruta’s variable selection methods were applied, identifying the most relevant environmental covariates. Results demonstrated that only 5 out of 40 environmental covariates were optimal for SOC modeling. Using these selected covariates, the RF model achieved the highest prediction accuracy (R² = 0.84, RMSE = 0.069%, and PRD = 3.6%). The RF model effectively captured the inherent variability and complexity of soil properties, yielding precise and reliable SOC predictions. The results emphasize the capability of machine learning in predicting SOC levels, aiding in the enhancement of soil management strategies and agricultural planning. Ultimately, this study provides a foundation for integrating advanced predictive techniques to enhance SOC assessment in future study.

Similar content being viewed by others

Enhancing digital mapping of soil organic carbon through spatial modeling and validation

Geospatial digital mapping of soil organic carbon using machine learning and geostatistical methods in different land uses

Environmental variables improve the accuracy of remote sensing estimation of soil organic carbon content

Acknowledgements

This study was supported by the Forestry Science and Technology Research and Innovation Project of Hunan Province, China [NO. XLK202435] and Hunan Province Post-Graduate Research Innovation Project [NO. CX20230776]. The authors of this article would like to thank from them for accepting all expenses of this study.

Funding

(1) 2024 Forestry Science and Technology Research and Innovation Project of Hunan Province, China [NO. XLK202435]. (2) 2023 Hunan Province Post-Graduate Research Innovation Project [NO. CX20230776].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to
Zhengui Cai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics approval

All authors have read, understood, and have complied as applicable with the statement on “Ethical responsibilities of Authors” as found in the Instructions for Authors and are aware that with minor exceptions, no changes can be made to authorship once the paper is submitted.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Cai, X., Liu, F. & Cai, Z. Optimizing environmental covariates for digital mapping of soil organic carbon in Hunan Province, China.
Sci Rep (2026). https://doi.org/10.1038/s41598-026-56073-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41598-026-56073-9

Keywords

  • Boruta’s variable selection
  • Environmental covariates
  • Machine learning models
  • Variance inflation factor


Source: Ecology - nature.com

Dual effect of global urban trees on PM2.5 and associated health burden

Genomic and phenotypic insights into Pseudocolwellia antarctica sp. nov., a novel psychrotolerant bacterium with symbiotic potential from Antarctic zooplankton

Back to Top