Biblioteca Digital

Model selection and error estimation

**Autoria(s):** Bartlett, P. L.; Boucheron, S.; Lugosi, G.
Data(s)	2002
Resumo	We study model selection strategies based on penalized empirical loss minimization. We point out a tight relationship between error estimation and data-based complexity penalization: any good error estimate may be converted into a data-based penalty function and the performance of the estimate is governed by the quality of the error estimate. We consider several penalty functions, involving error estimates on independent test data, empirical VC dimension, empirical VC entropy, and margin-based quantities. We also consider the maximal difference between the error on the first half of the training data and the second half, and the expected maximal discrepancy, a closely related capacity estimate that can be calculated by Monte Carlo integration. Maximal discrepancy penalty functions are appealing for pattern classification problems, since their computation is equivalent to empirical risk minimization over the training data with some labels flipped.
Identificador	http://eprints.qut.edu.au/43932/
Publicador	Kluwer
Relação	DOI:10.1023/A:1013999503812 Bartlett, P. L., Boucheron, S., & Lugosi, G. (2002) Model selection and error estimation. Machine Learning, 48(1-3), pp. 85-113.
Direitos	Springer
Fonte	Faculty of Science and Technology; Mathematical Sciences
Palavras-Chave	#010200 APPLIED MATHEMATICS #080100 ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING #170200 COGNITIVE SCIENCE #Concentration inequalities #Empirical penalties #Model selection #Penalization #Calculations #Computational complexity #Error analysis #Estimation #Integration #Mathematical models #Monte Carlo methods #Pattern recognition #Error estimation #Learning systems
Tipo	Journal Article

Acesso ao item digital