Early warning of bond default risk is to predict the future bond default status based on the enter-prise's financial factors,non-financial factors,and external macro factors.For different combinations of varia-bles,the effect of default prediction is different;there must be an optimal combination of indicators,which can minimize the error of default prediction.For different thresholds,the effect of default prediction is differ-ent,and there is bound to be an optimal threshold of default judgment,which can distinguish between the bonds that default from those that do not to the greatest extent.This paper uses the random forest model to se-lect the feature combination,and studies the default risk of bonds based on Logit model.The first contribution of this paper is in the optimal feature selection.Under the premise of the minimum Type-Ⅱ Error,an optimal random forest is obtained by maximizing the AUC for different numbers of decision trees.According to the ranking of the importance of indicators,the optimal feature combination can be obtained by forward selection to maximize AUC.The second contribution is in the determination of the optimal default judgment threshold.Taking the minimum weighted sum of the Type-Ⅰ Error and Type-Ⅱ Error as the objective function,the optimal threshold of logical regression is deduced.The third is the prediction accuracy of this model is higher than popular big data prediction models.Based on data from bonds issued by Chinese bonds listed companies from 2014 to 2018,the empirical research shows that the key indicators affecting China's medium and short-term default prediction are:Monetary capital/short-term debt,net profit,the number of bonds issued by issuers,industry prosperity index,and industry entrepreneur confidence index.The key indicators affecting short-term default prediction are:Monetary assets,quick ratio,fixed asset investment price index,and money supply M0.The key indicators that have an impact on the medium-term default forecast are registered capital of the is-suer,repayment amount of bonds at maturity,and bond maturity index.
bond defaultdefault predictionfeature selectionoptimal thresholdrandom forests