Implementation of Dysarthria Identification Using MFCC and Multilayer Perceptron Algorithm

Abdul Fadlil; Latief Perdana; Ardi Pujiyanta

doi:10.14445/23488379/IJEEE-V12I1P105

Implementation of Dysarthria Identification Using MFCC and Multilayer Perceptron Algorithm

International Journal of Electrical and Electronics Engineering

Volume 12 Issue 1

Year of Publication : 2025

Authors : Abdul Fadlil, Latief Perdana, Ardi Pujiyanta, Herman, Haris Imam Karim Fathurrahman, Maulana Muhammad Jogo Samodro

10.14445/23488379/IJEEE-V12I1P105

How to Cite?

Abdul Fadlil, Latief Perdana, Ardi Pujiyanta, Herman, Haris Imam Karim Fathurrahman, Maulana Muhammad Jogo Samodro, "Implementation of Dysarthria Identification Using MFCC and Multilayer Perceptron Algorithm," SSRG International Journal of Electrical and Electronics Engineering, vol. 12, no. 1, pp. 32-46, 2025. Crossref, https://doi.org/10.14445/23488379/IJEEE-V12I1P105

Abstract:

Dysarthria is an inability of the child’s muscles to pronounce certain vocabulary. One of the words that is often difficult to pronounce is the R sound. Therefore, it is important to identify R sound dysarthria as a preventive measure and can be used as a therapeutic reference. The study uses the phrase “laler menclok pager” as the basis for picking up voice data in children. In that sentence, there is a letter R that will be processed later. The processing method used is MFCC. The output from the extraction of the MFCC characteristics is inserted as the input material of the Multilayer Perceptron (MLP) artificial intelligence algorithm. The results of this study provide a high degree of accuracy, and the test data can be well identified as a whole. The results also obtained the MLP configuration of 16 input neurons and 8 hidden neurons with the highest accuracy as well as the lightest computing. With this result, further hardware can be developed to integrate the system for identifying dysarthria.

Keywords:

Dysarthria, MFCC, MLP, Neuron, R sound.

References:

[1] M. Bourqui et al., “The Encoding of Speech Modes in Motor Speech Disorders: Whispered Versus Normal Speech in Apraxia of Speech and Hypokinetic Dysarthria,” Clinical Linguistics and Phonetics, pp. 1-22, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Sherine R. Tambyraja, Kelly Farquharson, and Laura M. Justice, “Phonological Processing Skills in Children with Speech Sound Disorder: A Multiple Case Study Approach,” International Journal of Language and Communication Disorders, vol. 58, no. 1, pp. 15-27, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Iida Aakko et al., “Auditory-Perceptual Evaluation with Visual Analogue Scale: Feasibility and Preliminary Evidence of Ultrasound Visual Feedback Treatment of Finnish [r],” Clinical Linguistics and Phonetics, vol. 37, no. 4-6, pp. 345-362, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Elizabeth Roepke, “Assessing Phonological Processing in Children with Speech Sound Disorders,” Perspectives of the ASHA Special Interest Groups, vol. 9, no. 1, pp. 1-21, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Hamza Kheddar, Mustapha Hemis, and Yassine Himeur, “Automatic Speech Recognition Using Advanced Deep Learning Approaches: A Survey,” Information Fusion, vol. 109, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Wenyi Yu et al., “Connecting Speech Encoder and Large Language Model for ASR,” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 12637-12641, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Pranav Kumar, Md. Talib Ahmad, and Ranjana Kumari, “HPO Based Enhanced Elman Spike Neural Network for Detecting Speech of People with Dysarthria,” Optical Memory and Neural Networks, vol. 33, no. 2, pp. 205-220, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Kodali Radha, Mohan Bansal, and Venkata Rao Dhulipalla, “Variable STFT Layered CNN Model for Automated Dysarthria Detection and Severity Assessment Using Raw Speech,” Circuits, Systems, and Signal Processing, vol. 43, no. 5, pp. 3261-3278, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[9] C. Zander et al., “Freiburg Neuropathology Case Conference: 68-Year-Old Patient with Slurred Speech, Double Vision, and Increasing Gait Disturbance,” Clinical Neuroradiology, vol. 34, no. 1, pp. 279-286, 2024.
[CrossRef] [Publisher Link]
[10] Huma Nasir, and Muhammad Arslan Zahid, “Chlorpromazine-Induced Neurological Symptoms Mimicking Stroke in an Elderly Patient with Intractable Hiccups: A Case Report,” Journal of Health and Rehabilitation Research, vol. 4, no. 1, pp. 995-999, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Malik Saad, Quratulain Maha, and Muhammad Talal, “Disseminated Salmonella Typhi Infection Presenting with Slurred Speech and Encephalopathy: An Unusual Presentation,” National Journal of Health Sciences., vol. 9, no. 2, pp. 131-136, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Alan Wayne Jones, “Dubowski’s Stages of Alcohol Influence and Clinical Signs and Symptoms of Drunkenness in Relation to A Person’s Blood-Alcohol Concentration-Historical Background,” Journal of Analytical Toxicology, vol. 48, no. 3, pp. 131-140, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Panduranga Vital Terlapu, and R. Prasad Reddy Sadi, “Real-time Speech-based Intoxication Detection System: Vowel Biomarker Analysis with Artificial Neural Networks,” International Journal of Computing and Digital Systems, vol. 15, no. 1, pp. 1637-1666, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Ran Zhou et al., “MFCC Based Real-Time Speech Reproduction and Recognition Using Distributed Acoustic Sensing Technology,” Optoelectronics Letters, vol. 20, no. 4, pp. 222-227, Apr. 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Manjit Singh Sidhu, Nur Atiqah Abdul Latib, and Kirandeep Kaur Sidhu, “MFCC In Audio Signal Processing for Voice Disorder: A Review,” Multimedia Tools and Applications, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Siba Prasad Mishra, Pankaj Warule, and Suman Deb, “Speech Emotion Recognition Using MFCC-Based Entropy Feature,” Signal, Image Video Processing, vol. 18, no. 1, pp. 153-161, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Nur Aishah Zainal et al., “Integration of MFCCs and CNN for Multi-Class Stress Speech Classification on Unscripted Dataset,” IIUM Engineering Journal, vol. 25, no. 2, pp. 381-395, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Wittaya Jitchaijaroen et al., “Machine Learning Approaches for Stability Prediction of Rectangular Tunnels in Natural Clays Based on MLP and RBF Neural Networks,” Intelligent Systems with Applications, vol. 21, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Nikhil B. Gaikwad et al., “Hardware Design and Implementation of Multiagent MLP Regression for the Estimation of Gunshot Direction on IoBT Edge Gateway,” IEEE Sensors Journal, vol. 23, no. 13, pp. 14549-14557, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Amjad Alsirhani et al., “A Novel Approach to Predicting the Stability of The Smart Grid Utilizing MLP-ELM Technique,” Alexandria Engineering Journal, vol. 74, pp. 495-508, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Qiang Gao et al., “Electroencephalogram Signal Classification Based on Fourier Transform and Pattern Recognition Network for Epilepsy Diagnosis,” Engineering Applications of Artificial Intelligence, vol. 123, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[22] J. Naskath, G. Sivakamasundari, and A. Alif Siddiqua Begum, “A Study on Different Deep Learning Algorithms Used in Deep Neural Nets: MLP SOM and DBN,” Wireless Personal Communications, vol. 128, no. 4, pp. 2913-2936, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Ahmad Abbaskhah, Hamed Sedighi, and Hossein Marvi, “Infant Cry Classification by MFCC Feature Extraction with MLP and CNN Structures,” Biomedical Signal Processing and Control, vol. 86, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Yuanyuan Wei et al., “AE-MLP: A Hybrid Deep Learning Approach for DDoS Detection and Classification,” IEEE Access, vol. 9, pp. 146810-146821, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Joomee Song et al., “Detection and Differentiation of Ataxic and Hypokinetic Dysarthria in Cerebellar Ataxia and Parkinsonian Disorders Via Wave Splitting and Integrating Neural Networks,” PLoS One, vol. 17, no. 6, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Gennaro Tartarisco et al., “Artificial Intelligence for Dysarthria Assessment in Children with Ataxia: A Hierarchical Approach,” IEEE Access, vol. 9, pp. 166720-166735, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[27] Adolfo M. García et al., “Cognitive Determinants of Dysarthria in Parkinson’s Disease: An Automated Machine Learning Approach,” Movement disorders, vol. 36, no. 12, pp. 2862-2873, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[28] Mayur R Gamit, and Kinnal Dhameliya, “Isolated Words Recognition Using MFCC, LPC and Neural Network,” International Journal of Research in Engineering and Technology, vol. 4, no. 6, pp. 146-149, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Amit Moondra, and Poonam Chahal, “Improved Speaker Recognition for Degraded Human Voice using Modified-MFCC and LPC with CNN,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 4, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[30] V. Anil Kumar, Ch. V. Rama Rao, and N. Leema, “Audio Source Separation by Estimating the Mixing Matrix in Underdetermined Condition Using Successive Projection and Volume Minimization,” International Journal of Information Technology, vol. 15, no. 4, pp. 1831-1844, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[31] Lei Fan et al., “Accurate Frequency Estimator of Sinusoid Based on Interpolation of FFT and DTFT,” IEEE Access, vol. 8, pp. 44373-44380, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[32] Sumam Sebastian, “Performance Evaluation by Artificial Neural Network Using WEKA,” International Research Journal of Engineering and Technology (IRJET), vol. 3, no. 3, pp. 1459-1464, 2016.
[Google Scholar] [Publisher Link]
[33] Dong-Her Shih et al., “Dysarthria Speech Detection Using Convolutional Neural Networks with Gated Recurrent Unit,” Healthcare, vol. 10, no. 10, pp. 1-14, 2022.
[CrossRef] [Google Scholar] [Publisher Link]

IJEEE MENUS

Call for Paper - Upcoming Issues

Implementation of Dysarthria Identification Using MFCC and Multilayer Perceptron Algorithm

How to Cite?

Abstract:

Keywords:

References: