Linear KernelPCA and K-Means Clustering Using New Estimated Eigenvectors of the Sample Covariance Matrix

Nassara Elhadji Ille Gado; Edith Grall-Maës; Malika Kharouf

doi:10.1109/ICMLA.2015.207

Communication Dans Un Congrès Année : 2015

Linear KernelPCA and K-Means Clustering Using New Estimated Eigenvectors of the Sample Covariance Matrix

(1) , (1) , (1)

Nassara Elhadji Ille Gado

Fonction : Auteur

Laboratoire Modélisation et Sûreté des Systèmes

Edith Grall-Maës

Fonction : Auteur

Laboratoire Modélisation et Sûreté des Systèmes

Malika Kharouf

Fonction : Auteur
PersonId : 1107976

Laboratoire Modélisation et Sûreté des Systèmes

Résumé

In this article, random matrix theory is used to propose a new K-means clustering algorithm via linear PCA. Our approach is devoted to linear PCA estimation when the number of the features d and the number of samples n go to infinity at the same rate. More precisely, we deal with the problem of building a consistent estimator of the eigenvectors of the covariance data matrix. Numerical results, based on the normalized mutual information (NMI) and the final error rate (ER), are provided and support our algorithm, even for a small number of features/samples. We also compare our approach to spectral clustering, K-means and traditional PCA methods.

Domaines

Apprentissage [cs.LG]

Jean-Baptiste VU VAN : Connectez-vous pour contacter le contributeur

https://utt.hal.science/hal-02330752

Soumis le : jeudi 24 octobre 2019-10:35:43

Dernière modification le : mercredi 24 avril 2024-17:44:31

Dates et versions

hal-02330752 , version 1 (24-10-2019)

Identifiants

HAL Id : hal-02330752 , version 1
DOI : 10.1109/ICMLA.2015.207

Citer

Nassara Elhadji Ille Gado, Edith Grall-Maës, Malika Kharouf. Linear KernelPCA and K-Means Clustering Using New Estimated Eigenvectors of the Sample Covariance Matrix. 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA), Dec 2015, Miami, United States. pp.386-389, ⟨10.1109/ICMLA.2015.207⟩. ⟨hal-02330752⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UTT UTT-LIST3N LM2S-UTT

14 Consultations

0 Téléchargements

Linear KernelPCA and K-Means Clustering Using New Estimated Eigenvectors of the Sample Covariance Matrix

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager