SVM-KM: Speeding SVMs Learning with a priori Cluster Selection and k-Means

**
Marcelo Barros de Almeida,
Antônio de Pádua Braga,
João Pedro Braga.
**

A procedure called SVM-KM, based on clustering by k-means and to accelerate the training of Support Vector Machine, is the main objective of the present work. During the Support Vector Machines (SVMs) optimization phase, training vectors near the separation margins are likely to become support vector and must be preserved. Conversely, training vectors far from the margins are not in general taken into account for SVM's design process. SVM-KM groups the training vector in many clusters. Clusters formed only by a vector that belongs to the same class label can be disregard and only cluster centers are used. On the other hand, clusters with more than one class label are unchanged and all training vectors belonging to them are considered. Cluster with mixed composition are likely to happen near the separation margins and they may hold some support vectors. Consequently, the number of vectors in a SVM training is smaller and the training time can be decreased without compromising the generalization capability of the SVM.

http://csdl.computer.org/comp/proceedings/sbrn/2000/0856/00/08560162abs.htm

Biblioteca Digital Brasileira de Computação - Contato: bdbcomp@lbd.dcc.ufmg.br