Abstract
In this paper, maximum likelihood-based automatic lexicon generation using mixed-syllables is proposed for unlimited vocabulary voice interface for East Asian languages (e.g. Korean, Chinese and Japanese) in AI-assistant based interaction with mobile devices. The conventional lexicon has two inevitable problems: 1) a tedious repetition of out-of-lexicon unit additions to the lexicon, and 2) the propagation of errors during a morpheme analysis and space segmentation. The proposed method provides an automatic framework to solve the above problems. The proposed method produces a level of overall accuracy similar to one of previous methods in the presence of one out-of-lexicon word in a sentence, but the proposed method provides superior results with the absolute improvements of 1.62%, 5.58%, and 10.09% in terms of word accuracy when the number of out-of-lexicon words in a sentence was two, three and four, respectively.
| Original language | English |
|---|---|
| Pages (from-to) | 4264-4279 |
| Number of pages | 16 |
| Journal | KSII Transactions on Internet and Information Systems |
| Volume | 11 |
| Issue number | 9 |
| DOIs | |
| State | Published - 30 Sep 2017 |
Keywords
- Automatic lexicon generation
- Intelligent personal assistant (IPA)
- Maximum likelihood
- Out-of-lexicon (OOL)
- Speech recognition
Fingerprint
Dive into the research topics of 'Maximum likelihood-based automatic lexicon generation for AI assistant-based interaction with mobile devices'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver