Biblioteca Digital

On the impact of disproportional samples in credit scoring models: An application to a Brazilian bank data

**Autoria(s):** Louzada, Francisco; Silva, Paulo Henrique Ferreira da; Diniz, Carlos Alberto Ribeiro
Contribuinte(s)	UNIVERSIDADE DE SÃO PAULO
Data(s)	21/10/2013 21/10/2013 2012
Resumo	Statistical methods have been widely employed to assess the capabilities of credit scoring classification models in order to reduce the risk of wrong decisions when granting credit facilities to clients. The predictive quality of a classification model can be evaluated based on measures such as sensitivity, specificity, predictive values, accuracy, correlation coefficients and information theoretical measures, such as relative entropy and mutual information. In this paper we analyze the performance of a naive logistic regression model (Hosmer & Lemeshow, 1989) and a logistic regression with state-dependent sample selection model (Cramer, 2004) applied to simulated data. Also, as a case study, the methodology is illustrated on a data set extracted from a Brazilian bank portfolio. Our simulation results so far revealed that there is no statistically significant difference in terms of predictive capacity between the naive logistic regression models and the logistic regression with state-dependent sample selection models. However, there is strong difference between the distributions of the estimated default probabilities from these two statistical modeling techniques, with the naive logistic regression models always underestimating such probabilities, particularly in the presence of balanced samples. (C) 2012 Elsevier Ltd. All rights reserved.
Identificador	Expert Systems With Applications, Oxford, v. 39, n. 9, supl. 1, Part 1, p. 8071-8078, Jul, 2012 0957-4174 http://www.producao.usp.br/handle/BDPI/35276 10.1016/j.eswa.2012.01.134 http://dx.doi.org/10.1016/j.eswa.2012.01.134
Idioma(s)	eng
Publicador	PERGAMON-ELSEVIER SCIENCE LTD OXFORD
Relação	Expert Systems With Applications
Direitos	closedAccess Copyright PERGAMON-ELSEVIER SCIENCE LTD
Palavras-Chave	#CLASSIFICATION MODELS #NAIVE LOGISTIC REGRESSION #LOGISTIC REGRESSION WITH STATE-DEPENDENT SAMPLE SELECTION #PERFORMANCE MEASURES #CREDIT SCORING #PROTEIN SECONDARY STRUCTURE #STRUCTURE PREDICTION #NEURAL-NETWORKS #CLASSIFICATION #ACCURACY #ALGORITHMS #LOANS #COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE #ENGINEERING, ELECTRICAL & ELECTRONIC #OPERATIONS RESEARCH & MANAGEMENT SCIENCE
Tipo	article original article publishedVersion

Acesso ao item digital