Phoneme-Based Persian Speech Recognition

Reading time: 3 minute
...

📝 Original Info

  • Title: Phoneme-Based Persian Speech Recognition
  • ArXiv ID: 1901.04699
  • Date: 2019-01-15
  • Authors: Saber Malekzadeh

📝 Abstract

Undoubtedly, one of the most important issues in computer science is intelligent speech recognition. In these systems, computers try to detect and respond to the speeches they are listening to, like humans. In this research, presenting of a suitable method for the diagnosis of Persian phonemes by AI using the signal processing and classification algorithms have tried. For this purpose, the STFT algorithm has been used to process the audio signals, as well as to detect and classify the signals processed by the deep artificial neural network. At first, educational samples were provided as two phonological phrases in Persian language and then signal processing operations were performed on them. Then the results for the data training have been given to the artificial deep neural network. At the final stage, the experiment was conducted on new sounds.

💡 Deep Analysis

Deep Dive into Phoneme-Based Persian Speech Recognition.

Undoubtedly, one of the most important issues in computer science is intelligent speech recognition. In these systems, computers try to detect and respond to the speeches they are listening to, like humans. In this research, presenting of a suitable method for the diagnosis of Persian phonemes by AI using the signal processing and classification algorithms have tried. For this purpose, the STFT algorithm has been used to process the audio signals, as well as to detect and classify the signals processed by the deep artificial neural network. At first, educational samples were provided as two phonological phrases in Persian language and then signal processing operations were performed on them. Then the results for the data training have been given to the artificial deep neural network. At the final stage, the experiment was conducted on new sounds.

📄 Full Content

54 (4): -565 . doi:1001016/j.specom. 20110110004

..

Boashash, Boualem(2015). Time-frequency signal analysis and processing: a comprehensive reference. Academic Press.. Graves, Alex, Abdel-rahman Mohamed, and Geoffrey Hinton.( 2013) “Speech recognition with deep recurrent neural networks.” Acoustics, speech and signal processing (icassp), 2013 ieee international conference on. IEEE..

LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. ( 2015 Rumelhart, David E., Geoffrey E. Hinton.(1995) “Learning Internal Representations by Error Propagation”. David E. Rumelhart, James L. McClelland, and the PDP research group. (editors), Parallel distributed processing: Explorations in the mic.

[ 30 ] MacNeilage, Peter, (2008). The Origin of Speech. Oxford: Oxford University Press..

Gupta, Vishwa N., et al.(1995)

…………………………. ………………………….. …………. …………………………. ………………………….. ………………………….. ……………. …………………………. ………………………….. …………….. BIDE ‫و‬ BOWED

BBN ‫شرکت‬ ، IBM ‫شگاه‬ ‫دان‬ ،

2 Juang, B. H.; Rabiner, Lawrence R.(2015) “Automatic speech recognition-a brief history of the technology development” (PDF): 6. Retrieved 17 January.. [ 8 ] Pierce, John R. (1969). “Whither speech recognition “?Journal of the Acoustical Society of America. 46 (48): 1049. doi

2 Juang, B. H.; Rabiner, Lawrence R.(2015

2 Juang, B. H.; Rabiner, Lawrence R.(

Donald O. Hebb

n BatchNormalizatio

📸 Image Gallery

cover.png

Reference

This content is AI-processed based on ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut