Heart Sound Classification with MFCCs and Wavelet Daubechies Analysis Using Machine Learning Algorithms.
Sebastian Guzman-Alfaro, Karen E Villagrana-Bañuelos, Manuel A Soto-Murillo, Jorge Isaac Galván-Tejada, Antonio Baltazar-Raigosa, Angel Garcia-Duran, José María Celaya-Padilla, Andrea Acuña-Correa
Abstract
Open AccessBackground/Objectives: Cardiovascular diseases are the leading cause of mortality worldwide according to the World Health Organization (WHO), highlighting the need for accessible tools for early detection. Automated classification systems based on signal processing and machine learning offer a non-invasive alternative to support clinical diagnosis. Methods: This study implements and evaluates machine learning models for distinguishing normal and abnormal heart sounds using a hybrid feature extraction approach. Recordings labeled as normal, murmur, and extrasystolic were obtained from the PASCAL dataset and subsequently binarized into two classes. Multiple numerical datasets were generated through statistical features derived from Mel-Frequency Cepstral Coefficients (MFCCs) and Daubechies wavelet analysis. Each dataset was standardized and used to train four classifiers: support vector machines, logistic regression, random forests, and decision trees. Results: Model performance was assessed using accuracy, precision, recall, specificity, F1-score, and area under curve. All classifiers achieved notable results; however, the support vector machine model trained with 26 MFCCs and Daubechies-4 wavelet coefficients obtained the best performance. Conclusions: These findings demonstrate that the proposed hybrid MFCC-Wavelet framework provides competitive diagnostic accuracy and represents a lightweight, interpretable, and computationally efficient solution for computer-aided auscultation and early cardiovascular screening.