Abstract
We explore a novel approach to upper bound the misclassification error for problems with data comprising a small number of positive samples and a large number of negative samples. We assign the hinge-loss to upper bound the misclassification error of the positive examples and use the minimax risk to upper bound the misclassification error with respect to the worst case distribution that generates the negative examples. This approach is computationally appealing since the majority of training examples (belonging to the negative class) are represented by the statistics of their distribution, in contrast to kernel SVM which produces a very large number of support vectors in such settings. We derive empirical risk bounds for linear and non-linear classification and show that they are dimensionally independent and decay as 1/√m for m samples. We propose an efficient algorithm for training an intersection of finite number of hyperplanes and demonstrate its effectiveness on real data, including letter and scene recognition.
Original language | English |
---|---|
Title of host publication | 32nd International Conference on Machine Learning, ICML 2015 |
Editors | David Blei, Francis Bach |
Publisher | International Machine Learning Society (IMLS) |
Pages | 1558-1566 |
Number of pages | 9 |
ISBN (Electronic) | 9781510810587 |
State | Published - 2015 |
Event | 32nd International Conference on Machine Learning, ICML 2015 - Lile, France Duration: 6 Jul 2015 → 11 Jul 2015 |
Publication series
Name | 32nd International Conference on Machine Learning, ICML 2015 |
---|---|
Volume | 2 |
Conference
Conference | 32nd International Conference on Machine Learning, ICML 2015 |
---|---|
Country/Territory | France |
City | Lile |
Period | 6/07/15 → 11/07/15 |
Bibliographical note
Publisher Copyright:Copyright © 2015 by the author(s).
ASJC Scopus subject areas
- Human-Computer Interaction
- Computer Science Applications