Kernel Hierarchical Agglomerative Clustering - Comparison of Different Gap Statistics to Estimate the Number of Clusters

Na Li; Nicolas Lefèbvre; Régis Lengellé

doi:10.5220/0004828202550262

Communication Dans Un Congrès Année : 2014

Kernel Hierarchical Agglomerative Clustering - Comparison of Different Gap Statistics to Estimate the Number of Clusters

(1) , (1) , (1)

Na Li

Fonction : Auteur
PersonId : 956185

Laboratoire Modélisation et Sûreté des Systèmes

Nicolas Lefèbvre

Fonction : Auteur

Laboratoire Modélisation et Sûreté des Systèmes

Régis Lengellé

Fonction : Auteur
PersonId : 1093755

Laboratoire Modélisation et Sûreté des Systèmes

Résumé

Clustering algorithms, as unsupervised analysis tools, are useful for exploring data structure and have owned great success in many disciplines. For most of the clustering algorithms like k-means, determining the number of the clusters is a crucial step and is one of the most difficult problems. Hierarchical Agglomerative Clustering (HAC) has the advantage of giving a data representation by the dendrogram that allows clustering by cutting the dendrogram at some optimal level. In the past years and within the context of HAC, efficient statistics have been proposed to estimate the number of clusters and the Gap Statistic by Tibshirani has shown interesting performances. In this paper, we propose some new Gap Statistics to further improve the determination of the number of clusters. Our works focus on the kernelized version of the widely-used Hierarchical Clustering Algorithm.

Mots clés

Hierarchical Agglomerative Clustering Gap Statistics Kernel Alignment Number of Clusters

Domaines

Informatique [cs]

Jean-Baptiste VU VAN : Connectez-vous pour contacter le contributeur

https://utt.hal.science/hal-02861450

Soumis le : mardi 9 juin 2020-07:57:45

Dernière modification le : vendredi 12 janvier 2024-16:48:20

Dates et versions

hal-02861450 , version 1 (09-06-2020)

Identifiants

HAL Id : hal-02861450 , version 1
DOI : 10.5220/0004828202550262

Citer

Na Li, Nicolas Lefèbvre, Régis Lengellé. Kernel Hierarchical Agglomerative Clustering - Comparison of Different Gap Statistics to Estimate the Number of Clusters. International Conference on Pattern Recognition Applications and Methods, Mar 2014, Angers, France. pp.255-262, ⟨10.5220/0004828202550262⟩. ⟨hal-02861450⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UTT UTT-LIST3N LM2S-UTT

14 Consultations

0 Téléchargements

Kernel Hierarchical Agglomerative Clustering - Comparison of Different Gap Statistics to Estimate the Number of Clusters

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager