Multicollinearity Effect in Regression Analysis: A Feed Forward Artificial Neural Network Approach

Main Article Content

C. P. Obite
N. P. Olewuezi
G. U. Ugwuanyim
D. C. Bartholomew


In this study we compared the performance of Ordinary Least Squares Regression (OLSR) and the Artificial Neural Network (ANN) in the presence of multicollinearity using two datasets – a real life insurance data and a simulated data – to know which of the methods, models a highly correlated dataset better using the Root Mean Square Error (RMSE) as the performance measure. The ANN performed better than the OLSR model for all the different ANN models except the models with nine and ten nodes in the hidden layer for the real life data. The network with four hidden nodes was the best model. For the simulated data, the ANN model with two hidden nodes gave us the least RMSE when compared to the OLSR model and the other ANN models in the testing set. The network with two hidden nodes modelled the data very well. In the presence of multicollinearity, ANN model achieves a better fit and forecast than the OLSR.

Multicollinearity, ordinary least squares, artificial neural network, root mean square error.

Article Details

How to Cite
P. Obite, C., P. Olewuezi, N., U. Ugwuanyim, G., & C. Bartholomew, D. (2020). Multicollinearity Effect in Regression Analysis: A Feed Forward Artificial Neural Network Approach. Asian Journal of Probability and Statistics, 6(1), 22-33.
Original Research Article


Wonsuk Y, Robert M, Sejong B, Karan S, Qinghua P, James WL. A study of effects of multicollinearity in the multivariable analysis. International Journal of Applied Science and Technology. 2014;4(5):9-19.

Onoghojobi B, Olewuezi NP, Obite CP. Comparison of linear prediction methods in time series. Journal of the Nigerian Association of Mathematical Physics. 2016;34:151-156.

Olewuezi NP, Onoghojobi B, Bartholomew DC. Estimation of nonorthogornal problem using time series dataset. Journal of the Nigerian Association of Mathematical Physics. 2016;34:141–150.

Yazid MA, Mowafaq MA. A Monte Carlo comparison between ridge and principal components regression methods. Applied Mathematical Sciences. 2009;3(42):2085–2098.

Zou J, Han Y, So SS. Overview of artificial neural networks. Methods Mol Biol. 2008;458:15-23.

Hsiao-Tien Pao. A comparison of neural network and multiple regression analysis in modelling capital structure. Expert Systems with Applications. 2008;35:720-727.

Kumar UA. Comparison of neural networks and regression analysis: A new insight. Expert Systems with Applications. 2005;29(2):424–430.

Ramírez MCV, de Campos Velho HF, Ferreira NJ. Artificial neural network technique for rainfall forecasting applied to the São Paulo region. J. Hydrol. 2005;301:146–162.

Asep S, Agung PR, Asep KS. The application of ANN-linear perceptron in the development of DSS for a fishery industry. Procedia Computer Science. 2015;72:67-77.

Soukayna M, Jan FA. Prediction of SPEI using MLR and ANN: A case study for Wilsons Promontory Station in Victoria. International Conference on Machine Learning and Application. 2015;14:318-324.

Ebiendele EP, Ebiendele EP. Skill comparison of multiple-linear regression model and artificial neural network model in seasonal rainfall prediction-North East Nigeria. Asian Research Journal of Mathematics. 2018;11(2):1-10.

Aysenur G, Elif T, Derya U, Sibel Y, Nurgul A. Prediction of the colorimetric parameters and mass loss of heat-treated bamboo: Comparison of multiple linear regression and artificial neural network method. Colour Research and Application. 2019;44:1-10.

Ilaboya IR, Igbinedion OE. Performance of Multiple Linear Regression (MLR) and Artificial Neural Network (ANN) for the prediction of monthly maximum rainfall in Benin City, Nigeria. International Journal of Engineering Science and Application. 2019;3(1):21-37.