Abstract
In recent years, statistical and data-driven modeling approaches have been increasingly employed to predict element concentrations and to examine relationships among environmental features. In this context, the integration of feature selection techniques with machine learning models enhances model generalization and reduces model complexity by enabling the identification of key elements that are strongly associated with the target feature. This study applies machine learning models to investigate the relationships between Aluminum (Al) and other elements and to predict Al concentration levels in an inland marine ecosystem. Specifically, the study evaluates whether accurate predictions can be achieved using a reduced subset of informative elements rather than the full feature set. The findings demonstrate that machine learning methods, when combined with feature selection, can successfully predict Al concentrations while yielding more interpretable models based on a limited number of significant elements.
Data Availability
The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.
References
Arslan, H., Agir, A. & Demir, G. Impacts of PM10 exposure on hospitalization for acute bronchitis in Ankara, Türkiye. Front. Life Sci. Relat. Technol. 5, 1–5 (2024).
Gjergjizi Nallbani, B. et al. Utilizing a medicinal plant and soil assays for heavy metal pollution assessment and its impacts in a Turkish Industrial Zone. Spectrosc. Lett. 1–18 (2024).
Banaee, M., Zeidi, A., Mikušková, N. & Faggio, C. Assessing metal toxicity on crustaceans in aquatic ecosystems: a comprehensive review. Biol. Trace Element Res. 1–19 (2024).
Fulke, A. B., Ratanpal, S. & Sonker, S. Understanding heavy metal toxicity: Implications on human health, marine ecosystems and bioremediation strategies. Mar. Pollut. Bull. 206, 116707 (2024).
Jean-Lavenir, N. M. et al. Potentially toxic metals contamination in stream sediments of MBAL area (Pan-African fold belt, Cameroon). Discov. Geosci. 2, 15 (2024).
Prabakaran, K., Sompongchaiyakul, P., Bureekul, S., Wang, X. & Charoenpong, C. Heavy metal bioaccumulation and risk assessment in fishery resources from the Gulf of Thailand. Mar. Pollut. Bull. 198, 115864 (2024).
Borrell, A. et al. High aluminum content in bone of marine mammals and its relation with source levels and origin. Environ. Pollut. 331, 121936 (2023).
Yazicioglu, H. et al. Physiological alterations and genotoxic damage under combined aluminum and cadmium treatments in Bryophyllum daigremontianum clones. Mol. Biol. Rep. 51, 1–21 (2024).
Botté, A., Zaidi, M., Guery, J., Fichet, D. & Leignel, V. Aluminium in aquatic environments: Abundance and ecotoxicological impacts. Aquat. Ecol. 56, 751–773 (2022).
Tiwari, A. K., Pal, D. B. & Prasad, N. Agricultural waste biomass utilization in waste water treatment. In Utilization of Waste Biomass in Energy, Environment and Catalysis, 19–41 (CRC Press, 2022).
Islam, M. S. et al. Contamination and ecological risk assessment of Cr, As, Cd and Pb in water and sediment of the southeastern Bay of Bengal coast in a developing country. Mar. Pollut. Bull. 197, 115720 (2023).
Lao, Y., Ma, J., Pan, K., Chen, F. & Zhang, Z. A brief review of effects of aluminum on marine diatoms. Bull. Environ. Contam. Toxicol. 113, 31 (2024).
Sáez-Guinoa, J., García-Franco, E., Llera-Sastresa, E. & Romeo, L. M. The effects of energy consumption of alumina production in the environmental impacts using life cycle assessment. Int. J. Life Cycle Assess. 29, 380–393 (2024).
Santore, R. C. et al. Development and application of a biotic ligand model for predicting the chronic toxicity of dissolved and precipitated aluminum to aquatic organisms. Environ. Toxicol. Chem. 37, 70–79 (2018).
Feng, J., Li, X., Yang, Y. & Zhou, Z. Preparation of low-cost aluminum-loaded longan shell adsorbent for fluoride removal: Experimental and modeling studies. J. Environ. Chem.Eng. 10, 108917 (2022).
Wang, J., Liu, Q., Xu, L., Siddique, M. S. & Yu, W. Impacts of water hardness on coagulation-UF-NF process using aluminum salts. Sep. Purif. Technol. 314, 123611 (2023).
Gantayat, R. R. & Elumalai, V. Salinity-induced changes in heavy metal behavior and mobility in semi-arid coastal aquifers: A comprehensive review. Water 16, 1052 (2024).
Ghani, S. A. A. Trace metals in seawater, sediments and some fish species from Marsa Matrouh Beaches in north-western Mediterranean coast. Egypt. The Egyptian J. Aquat. Res. 41, 145–154 (2015).
Angel, B. M., Apte, S. C., Batley, G. E. & Golding, L. A. Geochemical controls on aluminium concentrations in coastal waters. Environ. Chem. 13, 111–118 (2015).
Abdelaal, A., Abdelkader, A. I., Alshehri, F., Elatiar, A. & Almadani, S. A. Assessment and spatiotemporal variability of heavy metals pollution in water and sediments of a coastal landscape at the Nile Delta. Water 14, 3981 (2022).
Mbandzi-Phorego, N., Puccinelli, E., Pieterse, P. P., Ndaba, J. & Porri, F. Metal bioaccumulation in marine invertebrates and risk assessment in sediments from South African coastal harbours and natural rocky shores. Environ. Pollut. 124230 (2024).
Walton, R. C., McCrohan, C. R., Livens, F. & White, K. N. Trophic transfer of aluminium through an aquatic grazer-omnivore food chain. Aquat. Toxicol. 99, 93–99 (2010).
Hardisson, A., Revert, C., Gonzales-Weler, D. & Rubio, C. Aluminium exposure through the diet. Food Sci. Nutr 3, 19 (2017).
Barnhart, B., Flinders, C., Ragsdale, R., Johnson, G. & Wiegand, P. Deriving human health and aquatic life water quality criteria in the United States for bioaccumulative substances: A historical review and future perspective. Environ. Toxicol. Chem. 40, 2394–2405 (2021).
Alasfar, R. H. & Isaifan, R. J. Aluminum environmental pollution: The silent killer. Environ. Sci. Pollut. Res. 28, 44587–44597 (2021).
Bonfiglio, R., Scimeca, M. & Mauriello, A. The impact of aluminum exposure on human health. Arch. Toxicol. 97, 2997–2998 (2023).
Ergashov, A. The general effect of aluminum on the body. Int. J. Med. Sci. Clin. Res. 3, 90–95 (2023).
Yilmaz, N., Ozyigit, I. I., Dogan, I., Demir, G. & Yalcin, I. E. A case study performed in Küçükçekmece Lagoon channel/Istanbul, Turkey: How the heavy metal contamination and the seasonal variations on phytoplankton composition influence water quality. Desalin. Water Treat. 239, 126–136 (2021).
Trevizani, T. H., Domit, C., Santos, M. C. D. O. & Figueira, R. C. L. Bioaccumulation of heavy metals in estuaries in the southwest Atlantic Ocean. Environ. Sci. Pollut. Res. 30, 26703–26717 (2023).
Yaseen, Z. M. et al. Heavy metals prediction in coastal marine sediments using hybridizedmachine learning models with metaheuristic optimization algorithm. Chemosphere 352, 141329 (2024).
Deng, D., Wu, Y., Ren, B. & Yin, H. Applying Chemical and Statistical Analysis Methods to Evaluate Water and Stream Sediments around the Coal Mine Area in Dazhu China.. Water 15, 1421 (2023).
Bhagat, S. K. et al. Prediction of sediment heavy metal at the Australian Bays using newly developed hybrid artificial intelligence models. Environ. Pollut. 268, 115663 (2021).
Mert, B. K. & Kasapogullari, D. A case study of using artificial neural networks to predict heavy metal pollution in Lake Iznik. Environ. Monit. Assess. 196, 1–26 (2024).
Yolcu, U., Yalcin, I. E., Uras, M. E. & Ozyigit, I. I. Modeling the effects of essential heavy metals on environmental pollution: A linear and nonlinear prediction model via cascade forward-neural network. Math. Methods Appl. Sci. 47, 4306–4318 (2024).
Chen, D. et al. Delineating and identifying risk zones of soil heavy metal pollution in an industrialized region using machine learning. Environ. Pollut. 318, 120932 (2023).
Cui, S. et al. Advances and applications of machine learning and deep learning in environmental ecology and health. Environ. Pollut. 122358 (2023).
Gryech, I., Ghogho, M., Mahraoui, C. & Kobbane, A. An exploration of features impacting respiratory diseases in urban areas. Int. J. Environ. Res. Public Health 19, 3095 (2022).
Gryech, I., Asaad, C., Ghogho, M. & Kobbane, A. Applications of machine learning & Internet of Things for outdoor air pollution monitoring and prediction: A systematic literature review. Eng. Appl. Artif. Intell. 137, 109182 (2024).
Xu, X., Lai, T., Jahan, S. & Farid, F. Water and Sediment Analyse Using Predictive Models. arXiv preprint arXiv:2203.03422 (2022).
Nguyen, H. N. et al. Predicting element concentrations by machine learning models in neutron activation analysis. J. Radioanal. Nucl. Chem. 333, 1759–1768 (2024).
Han, Z. et al. Predicting and investigating water quality index by robust machine learning methods. J. Environ. Manag. 381, 125156 (2025).
Chen, X. et al. Water quality parameters-based prediction of dissolved oxygen in estuaries using advanced explainable ensemble machine learning. J. Environ. Manag. 380, 125146 (2025).
Dai, H., Huang, G., Zeng, H. & Yang, F. PM2.5 concentration prediction based on spatiotemporal feature selection using XGBoost-MSCNN-GA-LSTM. Sustainability 13, 12071 (2021).
Zhang, B., Zhang, Y. & Jiang, X. Feature selection for global tropospheric ozone prediction based on the BO-XGBoost-RFE algorithm. Sci. Rep. 12, 9244 (2022).
Demir, S. & Sahin, E. K. Liquefaction prediction with robust machine learning algorithms (SVM, RF, and XGBoost) supported by genetic algorithm-based feature selection and parameter optimization from the perspective of data processing. Environ. Earth Sci. 81, 459 (2022).
Garabaghi, F. H., Benzer, S. & Benzer, R. Modeling dissolved oxygen concentration using machine learning techniques with dimensionality reduction approach. Environ. Monit. Assess. 195, 879 (2023).
Ahajjam, A. et al. Enhancing prediction of wildfire occurrence and behavior in Alaska using spatio-temporal clustering and ensemble machine learning. Ecol. Info. 85, 102963 (2025).
Theng, D. & Bhoyar, K. K. Feature selection techniques for machine learning: A survey of more than two decades of research. Knowl. Inf. Syst. 66, 1575–1637 (2024).
Kohavi, R. & John, G. H. Wrappers for feature subset selection. Artif. Intell. 97, 273–324 (1997).
Guyon, I., Weston, J., Barnhill, S. & Vapnik, V. Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422 (2002).
Kuhn, M. et al. Package ‘caret’. R J. 223, 48 (2020).
Holland, J. Adaptation in Natural and Artificial Systems, lstedn (1975).
Eiben, A. E. & Smith, J. E. Introduction to evolutionary computing (Springer, 2015).
Wolpert, D. H. Stacked generalization. Neural Netw. 5, 241–259 (1992).
Seber, G. A. & Lee, A. J. Linear regression analysis (John Wiley & Sons, 2012).
Turksen, I. B. Fuzzy functions with LSE. Appl. Soft Comput. 8, 1178–1188 (2008).
Aladag, C. H., Turksen, I. B., Dalar, A. Z., Egrioglu, E. & Yolcu, U. Application of type-1fuzzy functions approach for time series forecasting. Turk. J. Fuzzy Syst. 5, 1–9 (2014).
Tak, N. & Inan, D. Type-1 fuzzy forecasting functions with elastic net regularization. Expert Syst. Appl. 199, 116916 (2022).
Cebeci, Z. et al. Package ‘ppclust’ (2019).
Hans, C. Elastic net regression modeling with the orthant normal prior. J. Am. Stat. Assoc. 106, 1383–1393 (2011).
Kelly, J. W., Degenhart, A. D., Siewiorek, D. P., Smailagic, A. & Wang, W. Sparse linear regression with elastic net regularization for brain-computer interfaces. In 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 4275–4278 (IEEE, 2012).
Friedman, J. et al. Package ‘glmnet’. CRAN R Repositary 595 (2021).
Friedman, J. H. Greedy function approximation: a gradient boosting machine. Annals of statistics 1189–1232 (2001).
Chen, T. & Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 785–794 (2016).
Qiu, Y. et al. Performance evaluation of hybrid WOA-XGBoost, GWO-XGBoost and BO-XGBoost models to predict blast-induced ground vibration. Eng. Comput. 38, 4145–4162 (2022).
Chen, T., He, T., Benesty, M. & Khotilovich, V. Package ‘xgboost’. R Version 90, 40 (2019).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
González-Sopeña, J., Pakrashi, V. & Ghosh, B. An overview of performance evaluation metrics for short-term statistical wind power forecasting. Renew. Sustain. Energy Rev. 138, 110515 (2021).
Bat, L., Şahin, F., Öztekin, A., Özsandıkçı, U. & Özkan, E. Y. Trace elements pollution in surface sediment of the sea of marmara coastal and transition water. Mar. Pollut. Bull. 218, 118067 (2025).
Pourang, N. et al. Spatiotemporal patterns, source apportionment, and ecological risk of major and trace elements in sediment cores from anzali international wetland. Sci. Rep. 15, 41965 (2025).
Li, J. Natural and anthropogenic controls on heavy metal distribution in east China sea sediments. Front. Mar. Sci. 12, 1689901 (2025).
Li, F., Yin, H., Zhu, T. & Zhuang, W. Understanding the role of manganese oxides in retaining harmful metals: Insights into oxidation and adsorption mechanisms at microstructure level. Eco-Environ. Health 3, 89–106 (2024).
Zeng, P. et al. RWKV-SKF: A recurrent architecture with state-space and frequency-domain filtering for dissolved oxygen predicting and revealing influencing mechanisms. Inf. Sci. 123018(2025).
Funding
This study was partially supported by the Scientific and Technological Research Council of Türkiye (TÜBİTAK) under Grant Number 123F266. The authors thank TÜBİTAK for their support.
Funding: Not received for publication fee.
Author information
Authors and Affiliations
Contributions
AU: Formal analysis, software, visualization, writing-original draft, writing-review; NT: Supervision, conceptualization, formal analysis, writing-original draft, writing-review; AHO: Financial support, material preparation, data curation, writing-review; IIO: Financial support, data curation, writing-review, and editing.
Corresponding author
Ethics declarations
Competing interests
On behalf of all authors of the paper, I declare that there is no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Supplementary Information. (download PDF )
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
Reprints and permissions
About this article
Cite this article
Ucan, A., Tak, N., Hocaoglu-Ozyigit, A. et al. Forecasting toxic metal concentrations in an inland sea ecosystem with machine learning algorithms.
Sci Rep (2026). https://doi.org/10.1038/s41598-026-48252-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-48252-5
Source: Ecology - nature.com
