Scalable sequential alternating proximal methods for sparse structural SVMs and CRFs


Autoria(s): Balamurugan, P; Shevade, Shirish; Babu, Ravindra T
Data(s)

2014

Resumo

Structural Support Vector Machines (SSVMs) and Conditional Random Fields (CRFs) are popular discriminative methods used for classifying structured and complex objects like parse trees, image segments and part-of-speech tags. The datasets involved are very large dimensional, and the models designed using typical training algorithms for SSVMs and CRFs are non-sparse. This non-sparse nature of models results in slow inference. Thus, there is a need to devise new algorithms for sparse SSVM and CRF classifier design. Use of elastic net and L1-regularizer has already been explored for solving primal CRF and SSVM problems, respectively, to design sparse classifiers. In this work, we focus on dual elastic net regularized SSVM and CRF. By exploiting the weakly coupled structure of these convex programming problems, we propose a new sequential alternating proximal (SAP) algorithm to solve these dual problems. This algorithm works by sequentially visiting each training set example and solving a simple subproblem restricted to a small subset of variables associated with that example. Numerical experiments on various benchmark sequence labeling datasets demonstrate that the proposed algorithm scales well. Further, the classifiers designed are sparser than those designed by solving the respective primal problems and demonstrate comparable generalization performance. Thus, the proposed SAP algorithm is a useful alternative for sparse SSVM and CRF classifier design.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/48846/1/kno_inf_sys_38-3_599_2014.pdf

Balamurugan, P and Shevade, Shirish and Babu, Ravindra T (2014) Scalable sequential alternating proximal methods for sparse structural SVMs and CRFs. In: KNOWLEDGE AND INFORMATION SYSTEMS, 38 (3). pp. 599-621.

Publicador

SPRINGER LONDON LTD

Relação

http://dx.doi.org/10.1007/s10115-013-0681-3

http://eprints.iisc.ernet.in/48846/

Palavras-Chave #Computer Science & Automation (Formerly, School of Automation)
Tipo

Journal Article

PeerReviewed