Accurate estimation of loss given default is necessary to estimating credit risk. Due to the bi-modal nature of LGD, the two-step LGD estimation model is a promising method for LGD estimation. This study improves the first model in the two-step LGD estimation model using probability machines (random forest, k-nearest neighbors, bagged nearest neighbors, and support vector machines). Furthermore, we compare the predictive performance of each model with traditional logistic regression models. This study confirms that random forest is the best model for developing the first model in the two-step LGD estimation model.
ASJC Scopus subject areas