Δημοσίευση - Speaker Clustering via the mean shift algorithm
ΤΑΥΤΟΤΗΤΑ

Speaker Clustering via the mean shift algorithm

Ερευνητική περιοχή:  
    
Είδος:  
Άρθρο σε πρακτικά

 

Έτος: 2010
Συγγραφείς: Θέμος Σταφυλάκης; Βασίλης Κατσούρος; Γεώργιος Καραγιάννης
Τίτλος βιβλίου: Proceedings of the 2010 Speaker and Language Recognition Workshop (Speaker Odyssey)
Σελίδες: 186 - 193
Διεύθυνση: Brno, Czech Republic
Οργανισμός: ISCA
Περίληψη:
In this paper, we investigate the use of the mean shift algorithm with respect to speaker clustering. The algorithm is an elegant nonparametric technique that has become very popular in image segmentation, video tracking and other image processing and computer vision tasks. Its primary aim is to detect the modes of the underlying density and consequently merge those observa- tions being attracted by each mode. Since the number of modes is not needed to be known beforehand, the algorithm seems to fit well to the problem of speaker clustering. However, the al- gorithm needs to be adapted; the original algorithm acts on the space of observations, while speaker clustering algorithms act on the space of probabilistic parametric models. We attempt to adapt the algorithm, based on some basic concepts of infor- mation geometry, that are related to the exponential family of distributions.
[Bibtex]