Adaptive Natural Gradient Method for Learning Neural Networks with Large Data set in Mini-Batch Mode

Hyeyoung Park, Kwanyong Lee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Natural gradient learning, which is one of gradient descent learning methods, is known to have ideal convergence properties in the learning of hierarchical machines such as layered neural networks. However, there are a few limitations that degrades its practical usability: necessity of true probability density function of input variables and heavy computational cost due to matrix inversion. Though its adaptive approximation have been developed, it is basically derived for online learning mode, in which a single update is done for a single data sample. Noting that the on-line learning mode is not appropriate for the tasks with huge number of training data, this paper proposes a practical implementation of natural gradient for mini-batch learning mode, which is the most common setting in the real application with large data set. Computational experiments on benchmark datasets shows the efficiency of the proposed methods.

Original languageEnglish
Title of host publication1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages306-310
Number of pages5
ISBN (Electronic)9781538678220
DOIs
StatePublished - 18 Mar 2019
Event1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019 - Okinawa, Japan
Duration: 11 Feb 201913 Feb 2019

Publication series

Name1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019

Conference

Conference1st International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2019
Country/TerritoryJapan
CityOkinawa
Period11/02/1913/02/19

Keywords

  • Gradient descent learning
  • Mini-batch learning mode
  • Natural gradient learning
  • Neural networks
  • On-line learning

Fingerprint

Dive into the research topics of 'Adaptive Natural Gradient Method for Learning Neural Networks with Large Data set in Mini-Batch Mode'. Together they form a unique fingerprint.

Cite this