Singularity and slow convergence of the em algorithm for gaussian mixtures

Hyeyoung Park, Tomoko Ozeki

Research output: Contribution to journalArticlepeer-review

34 Scopus citations

Abstract

Singularities in the parameter spaces of hierarchical learning machines are known to be a main cause of slow convergence of gradient descent learning. The EM algorithm, which is another learning algorithm giving a maximum likelihood estimator, is also suffering from its slow convergence, which often appears when the component overlap is large. We analyze the dynamics of the EM algorithm for Gaussian mixtures around singularities and show that there exists a slow manifold caused by a singular structure, which is closely related to the slow convergence of the EM algorithm. We also conduct numerical simulations to confirm the theoretical analysis. Through the simulations, we compare the dynamics of the EM algorithm with that of the gradient descent algorithm, and show that their slow dynamics are caused by the same singular structure, and thus they have the same behaviors around singularities.

Original languageEnglish
Pages (from-to)45-59
Number of pages15
JournalNeural Processing Letters
Volume29
Issue number1
DOIs
StatePublished - Feb 2009

Keywords

  • EM algorithm
  • Gradient descent learning
  • Learning dynamics
  • Singularity
  • Slow convergence

Fingerprint

Dive into the research topics of 'Singularity and slow convergence of the em algorithm for gaussian mixtures'. Together they form a unique fingerprint.

Cite this