Practical consideration on generalization property of natural gradient learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Natural gradient, learning is known to resolve the plateau problem, which is the main cause of slow learning speed of neural nct- works. The adaptive natural gradicn t learning, which is an adaptive method of realizing the natural gradicn tlearning for neural networks, has also been developed and its practical advantage1 has been confirmed. In this paper, w e consider the generalization propert yof the natural gradicn t. method. Theoretically the standard gradient method and the natural gradicn t met hod has the same minimum in the error surface, thus the generalization performance should also be the same. However, in the practical sense, it is feasible that the natural gradicn tmethod gives smaller training error when the standard method stops learning in a plateau. In this case, the solutions that are practically obtained are different from each other, and their generalization performances also come to be different. Since these situations are very often in practical problems, it is necessary to compare the generalization property of the natural gradient learning method with the standard method. In this paper, we show a case that the practical generalization performance of the natural gradient learning is poorer than the standard gradient method, and try to solve the problem by including a rcgularization term in the natural gradient learning.

Original languageEnglish
Title of host publicationConnectionist Models of Neurons, Learning Processes, and Artificial Intelligence - 6th International Work-Conference on Artificial and Natural Neural Networks, IWANN 2001, Proceedings
PublisherSpringer Verlag
Pages402-409
Number of pages8
EditionPART 1
ISBN (Print)3540422358, 9783540422358
DOIs
StatePublished - 2001
Event6th International Work-Conference on Artificial and Natural Neural Networks, IWANN 2001 - Granada, Spain
Duration: 13 Jun 200115 Jun 2001

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume2084 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th International Work-Conference on Artificial and Natural Neural Networks, IWANN 2001
Country/TerritorySpain
CityGranada
Period13/06/0115/06/01

Fingerprint

Dive into the research topics of 'Practical consideration on generalization property of natural gradient learning'. Together they form a unique fingerprint.

Cite this