Support vector machines and regularization

Vladimir Cherkassky; Yunqian Ma

Support vector machines and regularization

Electrical and Computer Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Recently, there has been a growing interest in Statistical Learning Theory, aka VC theory, due to many successful applications of Support Vector Machines (SVMs). Even though most theoretical results in VC-theory (including all main concepts underlying SVM methodology) have been developed over 25 years ago, these concepts are occasionally misunderstood in the research community. This paper compares standard SVM regression and the regularization for learning dependencies from data. We point out that SVM approach has been developed in VC-theory under risk minimization approach, whereas the regularization approach has been developed under function approximation setting. This distinction is especially important since regularization-based learning is often presented as a purely constructive methodology (with no clearly stated problem setting), even though original regularization theory has been introduced under clearly stated function approximation setting. Further, we present empirical comparisons illustrating the effect of different mechanisms for complexity control (i.e., ε-insensitive loss vs standard ridge regression) on the generalization performance, under very simple settings using synthetic data sets. These comparisons suggest that the SVM approach to complexity control (via ε-loss) is more appropriate for learning under sparse high-dimensional settings.

Original language	English (US)
Title of host publication	Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005
Editors	M.W. Marcellin
Pages	166-171
Number of pages	6
State	Published - 2005
Event	Seventh IASTED International Conference on Signal and Image Processing, SIP 2005 - Honolulu, HI, United States Duration: Aug 15 2005 → Aug 17 2005

Publication series

Name	Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005

Other

Other	Seventh IASTED International Conference on Signal and Image Processing, SIP 2005
Country/Territory	United States
City	Honolulu, HI
Period	8/15/05 → 8/17/05

Keywords

Function approximation
Regularization
Structural risk minimization

OpenUrl availability

Full text

Cite this

Support vector machines and regularization. / Cherkassky, Vladimir; Ma, Yunqian.
Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005. ed. / M.W. Marcellin. 2005. p. 166-171 (Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Cherkassky, V & Ma, Y 2005, Support vector machines and regularization. in MW Marcellin (ed.), Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005. Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005, pp. 166-171, Seventh IASTED International Conference on Signal and Image Processing, SIP 2005, Honolulu, HI, United States, 8/15/05.

@inproceedings{0fd9f687d0f3437ba323853486433902,

title = "Support vector machines and regularization",

abstract = "Recently, there has been a growing interest in Statistical Learning Theory, aka VC theory, due to many successful applications of Support Vector Machines (SVMs). Even though most theoretical results in VC-theory (including all main concepts underlying SVM methodology) have been developed over 25 years ago, these concepts are occasionally misunderstood in the research community. This paper compares standard SVM regression and the regularization for learning dependencies from data. We point out that SVM approach has been developed in VC-theory under risk minimization approach, whereas the regularization approach has been developed under function approximation setting. This distinction is especially important since regularization-based learning is often presented as a purely constructive methodology (with no clearly stated problem setting), even though original regularization theory has been introduced under clearly stated function approximation setting. Further, we present empirical comparisons illustrating the effect of different mechanisms for complexity control (i.e., ε-insensitive loss vs standard ridge regression) on the generalization performance, under very simple settings using synthetic data sets. These comparisons suggest that the SVM approach to complexity control (via ε-loss) is more appropriate for learning under sparse high-dimensional settings.",

keywords = "Function approximation, Regularization, Structural risk minimization",

author = "Vladimir Cherkassky and Yunqian Ma",

year = "2005",

language = "English (US)",

isbn = "0889865183",

series = "Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005",

pages = "166--171",

editor = "M.W. Marcellin",

booktitle = "Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005",

note = "Seventh IASTED International Conference on Signal and Image Processing, SIP 2005 ; Conference date: 15-08-2005 Through 17-08-2005",

}

TY - GEN

T1 - Support vector machines and regularization

AU - Cherkassky, Vladimir

AU - Ma, Yunqian

PY - 2005

Y1 - 2005

N2 - Recently, there has been a growing interest in Statistical Learning Theory, aka VC theory, due to many successful applications of Support Vector Machines (SVMs). Even though most theoretical results in VC-theory (including all main concepts underlying SVM methodology) have been developed over 25 years ago, these concepts are occasionally misunderstood in the research community. This paper compares standard SVM regression and the regularization for learning dependencies from data. We point out that SVM approach has been developed in VC-theory under risk minimization approach, whereas the regularization approach has been developed under function approximation setting. This distinction is especially important since regularization-based learning is often presented as a purely constructive methodology (with no clearly stated problem setting), even though original regularization theory has been introduced under clearly stated function approximation setting. Further, we present empirical comparisons illustrating the effect of different mechanisms for complexity control (i.e., ε-insensitive loss vs standard ridge regression) on the generalization performance, under very simple settings using synthetic data sets. These comparisons suggest that the SVM approach to complexity control (via ε-loss) is more appropriate for learning under sparse high-dimensional settings.

AB - Recently, there has been a growing interest in Statistical Learning Theory, aka VC theory, due to many successful applications of Support Vector Machines (SVMs). Even though most theoretical results in VC-theory (including all main concepts underlying SVM methodology) have been developed over 25 years ago, these concepts are occasionally misunderstood in the research community. This paper compares standard SVM regression and the regularization for learning dependencies from data. We point out that SVM approach has been developed in VC-theory under risk minimization approach, whereas the regularization approach has been developed under function approximation setting. This distinction is especially important since regularization-based learning is often presented as a purely constructive methodology (with no clearly stated problem setting), even though original regularization theory has been introduced under clearly stated function approximation setting. Further, we present empirical comparisons illustrating the effect of different mechanisms for complexity control (i.e., ε-insensitive loss vs standard ridge regression) on the generalization performance, under very simple settings using synthetic data sets. These comparisons suggest that the SVM approach to complexity control (via ε-loss) is more appropriate for learning under sparse high-dimensional settings.

KW - Function approximation

KW - Regularization

KW - Structural risk minimization

UR - http://www.scopus.com/inward/record.url?scp=33644511750&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33644511750&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:33644511750

SN - 0889865183

T3 - Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005

SP - 166

EP - 171

BT - Proceedings of the Seventh IASTED International Conference on Signal and Image Processing, SIP 2005

A2 - Marcellin, M.W.

T2 - Seventh IASTED International Conference on Signal and Image Processing, SIP 2005

Y2 - 15 August 2005 through 17 August 2005

ER -

Support vector machines and regularization

Abstract

Publication series

Other

Keywords

OpenUrl availability

Other files and links

Fingerprint

Cite this