Generalization error and training error at singularities of multilayer perceptions

Shim Ichi Amari, Toinoko Ozeki, Hyeyoung Park

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The neuromanifold or the parameter space of multilayer per- ceptrons includes complex singularities at which the Fisher information matrix degenerates. The parameters are unidentifiable at singularities, and t his causes serious difficulties in learning, known as plateaus in the cost function. The natural or adaptive natural gradient method is proposed for overcoming this difficulty. It. is important to study the relation bet w een t he generalization error and and the t raining error at t he singularities. because t he generalization error is estimated in terms of the training error. The generalization error is studied both for the maximum likelihood estimator (mle) and the Baycsian predictive distribution est imator in terms of the Gaussian random field, by using a simple model. This elucidates the strange behaviors of learning dynamics around singularities.

Original languageEnglish
Title of host publicationConnectionist Models of Neurons, Learning Processes, and Artificial Intelligence - 6th International Work-Conference on Artificial and Natural Neural Networks, IWANN 2001, Proceedings
PublisherSpringer Verlag
Pages325-332
Number of pages8
EditionPART 1
ISBN (Print)3540422358, 9783540422358
DOIs
StatePublished - 2001
Event6th International Work-Conference on Artificial and Natural Neural Networks, IWANN 2001 - Granada, Spain
Duration: 13 Jun 200115 Jun 2001

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume2084 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th International Work-Conference on Artificial and Natural Neural Networks, IWANN 2001
Country/TerritorySpain
CityGranada
Period13/06/0115/06/01

Fingerprint

Dive into the research topics of 'Generalization error and training error at singularities of multilayer perceptions'. Together they form a unique fingerprint.

Cite this