A method for enhancing speech and warning signals based on parallel convolutional neural networks in a noisy environment

Ha Lim Kang, Sung Dae Na, Myoung Nam Kim

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

BACKGROUND: Digital hearing aids are based on technology that amplifies sound and removes noise according to the frequency of hearing loss in hearing loss patients. However, within the noise removed is a warning sound that alert the listener; the listener may be exposed to danger because the warning sound is not recognized. OBJECTIVE: In this paper, a deep learning model was used to improve these limits and propose a method to distinguish the warning sound in speech signals mixed with noise. In addition, the improved speech and warning sound were derived by removing noise present in the classification sound signals. METHODS: To classify the sound dataset, an adaptive convolution filter that changes according to two signals is proposed. The proposed convolution filter is applied to the PCNNs model to analyze the characteristics of the time and frequency domains of the dataset and classify the presence or absence of warning sound. In addition, the CEDN model was used to improve the intelligibility of the warning and the speech in the signal based on the warning sound classification from the proposed PCNNs model. RESULTS: Experimental results show that the PCNNs model using the proposed multiplicative filters is efficient for analyzing sound signals with complex frequencies. In addition, the CEDN model was used to improve the intelligibility of the warning and the speech in the signal based on the warning sound classification from the proposed PCNNs model. CONVLUSION: We confirmed that the PCNN model with the proposed filter showed the highest training rate, lowest error rate, and the most stable results. In addition, the CEDN model confirmed that speech and warning sounds were recognized, but it was confirmed that there was a limitation in clearly recognizing speech as the noise ratio increased.

Original languageEnglish
Pages (from-to)S141-S152
JournalTechnology and Health Care
Volume29
Issue numberS1
DOIs
StatePublished - 2021

Keywords

  • convolutional encoder-decoder network
  • parallel convolutional neural networks
  • sound enhancement
  • Warning sound classification

Fingerprint

Dive into the research topics of 'A method for enhancing speech and warning signals based on parallel convolutional neural networks in a noisy environment'. Together they form a unique fingerprint.

Cite this