%0 Journal Article %T Loan Default Prediction on Large Imbalanced Data Using Random Forests %A Lifeng Zhou %A Hong Wang %J TELKOMNIKA : Indonesian Journal of Electrical Engineering %D 2012 %I Institute of Advanced Engineering and Science %R 10.11591/telkomnika.v10i6.1323 %X In this paper, we propose an improved random forest algorithm which allocates weights to decision trees in the forest during tree aggregation for prediction and their weights are easily calculated based on out-of-bag errors in training. We compare the performance of our proposed algorithm and the original one on loan default prediction datasets. We also use these two algorithms to create two kinds of balanced random forests to deal with imbalanced data problem. Experiments results show that our proposed algorithm beats the original random forest in terms of both balanced and overall accuracy metrics. Experiments also show that parallel random forests can greatly improve random forests¡¯ efficiency during the learning process. %U http://www.iaesjournal.com/online/index.php/TELKOMNIKA/article/view/1323