Developing a Hybrid ARIMA-XGBOOST Model for Analysing Mobile Money Transaction Data in Kenya

Stephen Omondi Odhiambo *

Department of Mathematics and Actuarial Science, The Catholic University of Eastern Africa, Nairobi Kenya.

Nyakundi O. Cornelious

Department of Mathematics and Actuarial Science, Maasai Mara University, Kenya.

Hellen Waititu

Department of Mathematics and Actuarial Science, The Catholic University of Eastern Africa, Nairobi, Kenya.

*Author to whom correspondence should be addressed.


Abstract

This study formulated a hybrid ARIMA-XGBOOST model using the dataset from the Central Bank of Kenya and the objective was to formulate a Hybrid ARIMA-XGBOOST models to capture the patterns and dynamics in Mobile Money Transactions in Kenya. The study was motivated by the gaps that existed from the previous studies which lacked the ability to model both linear and non-linear trends in the data across time period. Since 2007 when Mpesa was invented, the rate of Mobile Money transactions has rose steadily and this required a complex model that will provide a clear trend. To accurately model the dynamics of Mobile Money Transactions in Kenya, this study presented a complex Hybrid model. The model was formulated by combining Autoregressive Integrated Moving Averages (ARIMA) and Extreme Gradient Boosting (XGBOOST). The ARIMA captures the linear trends of the data while the XGBOOST models the non-linear part and trained to model the ARIMA residuals. The model parameters were evaluated using Mean Absolute Error, Root Mean Square Error among others and confirmed to be accurate and reliable. Based on the findings from the ADF coefficient, the stationary condition was met (P-value=0.01), and therefore we proceeded to develop the ARIMA models. Initial diagnoses included model identification and examination of autocorrelation to determine the ARIMA configurations, whereas the Box-Jenkins test confirmed the models' adequacy (P-value=2.220e-16). Based on Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and Mean Percentage Error (MPE), which are all below 1 percent, indicates that the performance of the models had high prediction accuracy.

Keywords: Mpesa, mobile money transaction, cash in cash out in volume, value, ARIMA, XGBOOST


How to Cite

Odhiambo, Stephen Omondi, Nyakundi O. Cornelious, and Hellen Waititu. 2024. “Developing a Hybrid ARIMA-XGBOOST Model for Analysing Mobile Money Transaction Data in Kenya”. Asian Journal of Probability and Statistics 26 (10):108-26. https://doi.org/10.9734/ajpas/2024/v26i10662.