| Issue |
E3S Web Conf.
Volume 702, 2026
Second International Conference on Innovations in Sustainable and Digital Construction Practices (ISDCP 2026)
|
|
|---|---|---|
| Article Number | 02007 | |
| Number of page(s) | 13 | |
| Section | Environmental Engineering | |
| DOI | https://doi.org/10.1051/e3sconf/202670202007 | |
| Published online | 01 April 2026 | |
Machine Learning Models for the Prediction of Water Quality Index Using Physicochemical Characteristics of Groundwater
1 Department of Engineering and Technology, College of Engineering, University of Technology and Applied Sciences, Muscat, Oman., Oman.
2 Department of Civil Engineering, Dayananda Sagar College of Engineering, Bengaluru - 560 111, Karnataka, India
3 Civil Engineering Department, Al-Qalam University, Kirkuk, Iraq
4 Civil Engineering Department, Tishk International University, Sulaimani, Iraq. Email: This email address is being protected from spambots. You need JavaScript enabled to view it.
, ORCID: 0000-0002-8225-1721
5 Department of Civil Engineering, University College of Engineering, Thirukkuvalai, Nagapattinam, Tamil Nadu, India
* Corresponding author: This email address is being protected from spambots. You need JavaScript enabled to view it.
Abstract
This research examines the capacity of machine learning (ML) methods to predict the Water Quality Index (WQI) of groundwater from a set of physicochemical parameters. A dataset of 409 groundwater samples was examined, which includes pH, EC (electrical conductivity), TH (total hardness), Ca (calcium), Mg (magnesium), Na (sodium), K (potassium), HCO3 (bicarbonate), Cl (chloride), SO4 (sulfate), and NO3 (nitrate). The WQI was calculated and employed as the target variable in this analysis. Four regression models (Decision Tree Regressor, Random Forest Regressor, Gradient Boosting Regressor, XGBoost Regressor) were evaluated using statistical metrics. The Gradient Boosting Regressor demonstrated superior predictive capability among the four tested models, with an R2 = 0.8982, MAE = 9.2954, and MSE = 1361.6065. XGBoost and Random Forest models also have high predictive capacity; however, the Decision Tree model has relatively low accuracy compared to the other three models. The results suggest that ML algorithms can identify non-linear associations between groundwater quality parameters and WQI. This research shows that data-driven approaches can provide rapid and reliable predictions of groundwater quality, which may help reduce the need for extensive laboratory testing and promote better utilisation of water resources through more effective and efficient water resource management.
Publisher note: The letter "l" was missing at the end of author’s name "Vimal Arokiaraj George Gabriel". The article was corrected on 3 April 2026.
© The Authors, published by EDP Sciences, 2026
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

