THEORETICAL AND APPLIED CLIMATOLOGY, ( ISI ), Volume (157), No (315), Year (2026-4) , Pages (1-23)

Title : ( Machine learning-driven strategies for wheat yield prediction and climate adaptation in Arid regions: a comparative analysis of random forest and XGBoost )

Authors: Mohsen Jahan , Mehdi Nassiri Mahallati , Soltani , Yaghoubi ,

Citation: BibTeX | EndNote

Abstract

This study evaluates machine learning approaches for predicting irrigated wheat yield in Khorasan Razavi Province, Iran, under varying climatic conditions, addressing the need for robust agricultural forecasting tools. We compared the performance of Random Forest (RF) and eXtreme Gradient Boosting (XGB) algorithms in predicting wheat yield and classifying yield categories (high, medium, low) across 20 counties. Models were trained on 70% of a multi-year dataset, with hyperparameter tuning via Grid Search and five-fold cross-validation, and validated on an independent 30% test set. Performance was assessed using RMSE, R², MAE, and Willmott’s agreement index (d). Yield classification accuracy was evaluated via confusion matrices. The RF model outperformed XGB in yield prediction (RMSE: 395.13 kg ha⁻¹, R²: 0.63 vs. RMSE: 492.21 kg ha⁻¹, R²: 0.56), while XGB showed slightly higher classification accuracy (69.8% vs. 68.9%), with RF being more robust for medium-yield classification. SHAP analysis identified key predictors, including growing season length, minimum daily temperature, and temperature seasonality. Partial Dependence Plots revealed nonlinear variable relationships, such as yield decline above 200 mm precipitation. We conclude that RF is better suited for general yield prediction, whereas XGB excels in classification tasks. Integrating interpretable machine learning with climatic and agronomic data provides a powerful tool for enhancing agricultural decision-making and optimizing wheat yield predictions under climate variability.

Keywords

, Random Forest; XGB; Grid Search; Five, Fold Cross, Validation; SHAP; Partial Dependence
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@article{paperid:1107132,
author = {Jahan, Mohsen and Nassiri Mahallati, Mehdi and Mehdi and Fateme},
title = {Machine learning-driven strategies for wheat yield prediction and climate adaptation in Arid regions: a comparative analysis of random forest and XGBoost},
journal = {THEORETICAL AND APPLIED CLIMATOLOGY},
year = {2026},
volume = {157},
number = {315},
month = {April},
issn = {0177-798X},
pages = {1--23},
numpages = {22},
keywords = {Random Forest; XGB; Grid Search; Five-Fold Cross-Validation; SHAP; Partial Dependence},
}

[Download]

%0 Journal Article
%T Machine learning-driven strategies for wheat yield prediction and climate adaptation in Arid regions: a comparative analysis of random forest and XGBoost
%A Jahan, Mohsen
%A Nassiri Mahallati, Mehdi
%A Mehdi
%A Fateme
%J THEORETICAL AND APPLIED CLIMATOLOGY
%@ 0177-798X
%D 2026

[Download]