Header logo is

Discriminative K-means for Clustering

2008

Conference Paper

ei


We present a theoretical study on the discriminative clustering framework, recently proposed for simultaneous subspace selection via linear discriminant analysis (LDA) and clustering. Empirical results have shown its favorable performance in comparison with several other popular clustering algorithms. However, the inherent relationship between subspace selection and clustering in this framework is not well understood, due to the iterative nature of the algorithm. We show in this paper that this iterative subspace selection and clustering is equivalent to kernel K-means with a specific kernel Gram matrix. This provides significant and new insights into the nature of this subspace selection procedure. Based on this equivalence relationship, we propose the Discriminative K-means (DisKmeans) algorithm for simultaneous LDA subspace selection and clustering, as well as an automatic parameter estimation procedure. We also present the nonlinear extension of DisKmeans using kernels. We show that the learning of the ke rnel matrix over a convex set of pre-specified kernel matrices can be incorporated into the clustering formulation. The connection between DisKmeans and several other clustering algorithms is also analyzed. The presented theories and algorithms are evaluated through experiments on a collection of benchmark data sets.

Author(s): Ye, J. and Zhao, Z. and Wu, M.
Book Title: Advances in neural information processing systems 20
Journal: Advances in Neural Information Processing Systems 20: 21st Annual Conference on Neural Information Processing Systems 2007
Pages: 1649-1656
Year: 2008
Month: September
Day: 0
Editors: Platt, J. C., D. Koller, Y. Singer, S. Roweis
Publisher: Curran

Department(s): Empirical Inference
Bibtex Type: Conference Paper (inproceedings)

Event Name: Twenty-First Annual Conference on Neural Information Processing Systems (NIPS 2007)
Event Place: Vancouver, BC, Canada

Address: Red Hook, NY, USA
Digital: 0
ISBN: 978-1-605-60352-0
Language: en
Organization: Max-Planck-Gesellschaft
School: Biologische Kybernetik

Links: PDF
Web

BibTex

@inproceedings{4710,
  title = {Discriminative K-means for Clustering},
  author = {Ye, J. and Zhao, Z. and Wu, M.},
  journal = {Advances in Neural Information Processing Systems 20: 21st Annual Conference on Neural Information Processing Systems 2007},
  booktitle = {Advances in neural information processing systems 20},
  pages = {1649-1656},
  editors = {Platt, J. C., D. Koller, Y. Singer, S. Roweis},
  publisher = {Curran},
  organization = {Max-Planck-Gesellschaft},
  school = {Biologische Kybernetik},
  address = {Red Hook, NY, USA},
  month = sep,
  year = {2008},
  doi = {},
  month_numeric = {9}
}