Loading...
Please wait, while we are loading the content...
Similar Documents
Semi-supervised graph clustering: a kernel approach (2005)
| Content Provider | CiteSeerX |
|---|---|
| Author | Kulis, Brian Basu, Sugato Dhillon, Inderjit Mooney, Raymond |
| Description | Semi-supervised clustering algorithms aim to improve clustering results using limited supervision. The supervision is generally given as pairwise constraints; such constraints are natural for graphs, yet most semisupervised clustering algorithms are designed for data represented as vectors. In this paper, we unify vector-based and graph-based approaches. We show that a recently-proposed objective function for semi-supervised clustering based on Hidden Markov Random Fields, with squared Euclidean distance and a certain class of constraint penalty functions, can be expressed as a special case of the weighted kernel k-means objective. A recent theoretical connection between kernel kmeans and several graph clustering objectives enables us to perform semi-supervised clustering of data given either as vectors or as a graph. For vector data, the kernel approach also enables us to find clusters with nonlinear boundaries in the input data space. Furthermore, we show that recent work on spectral learning (Kamvar et al., 2003) may be viewed as a special case of our formulation. We empirically show that our algorithm is able to outperform current state-of-the-art semi-supervised algorithms on both vectorbased and graph-based data sets. 1. In ICML ’05: Proceedings of the 22nd international conference on Machine learning |
| File Format | |
| Language | English |
| Publisher | ACM Press |
| Publisher Date | 2005-01-01 |
| Access Restriction | Open |
| Subject Keyword | Certain Class Spectral Learning Kernel Approach Recent Theoretical Connection Algorithm Aim Kernel Kmeans Squared Euclidean Distance Graph-based Data Set Recent Work Limited Supervision Pairwise Constraint Semi-supervised Clustering Special Case Graph-based Approach Vector Data Semi-supervised Graph Clustering Several Graph Current State-of-the-art Semi-supervised Algorithm Input Data Space Recently-proposed Objective Function Semisupervised Clustering Algorithm Nonlinear Boundary Constraint Penalty Function Weighted Kernel K-means Hidden Markov Random Field |
| Content Type | Text |
| Resource Type | Article |