Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function

Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function. Petmezas, G., Cheimariotis, G., Stefanopoulos, L., Rocha, B., Paiva, R. P., Katsaggelos, A. K., & Maglaveras, N. Sensors, 22(3):1232, MDPI, feb, 2022.

Paper doi abstract bibtex

Respiratory diseases constitute one of the leading causes of death worldwide and directly affect the patient's quality of life. Early diagnosis and patient monitoring, which conventionally include lung auscultation, are essential for the efficient management of respiratory diseases. Manual lung sound interpretation is a subjective and time-consuming process that requires high medical expertise. The capabilities that deep learning offers could be exploited in order that robust lung sound classification models can be designed. In this paper, we propose a novel hybrid neural model that implements the focal loss (FL) function to deal with training data imbalance. Features initially extracted from short-time Fourier transform (STFT) spectrograms via a convolutional neural network (CNN) are given as input to a long short-term memory (LSTM) network that memorizes the temporal dependencies between data and classifies four types of lung sounds, including normal, crackles, wheezes, and both crackles and wheezes. The model was trained and tested on the ICBHI 2017 Respiratory Sound Database and achieved state-of-the-art results using three different data splitting strategies—namely, sensitivity 47.37%, specificity 82.46%, score 64.92% and accuracy 73.69% for the official 60/40 split, sensitivity 52.78%, specificity 84.26%, score 68.52% and accuracy 76.39% using interpatient 10-fold cross validation, and sensitivity 60.29% and accuracy 74.57% using leave-one-out cross validation.

@article{Petmezas2022a,
abstract = {Respiratory diseases constitute one of the leading causes of death worldwide and directly affect the patient's quality of life. Early diagnosis and patient monitoring, which conventionally include lung auscultation, are essential for the efficient management of respiratory diseases. Manual lung sound interpretation is a subjective and time-consuming process that requires high medical expertise. The capabilities that deep learning offers could be exploited in order that robust lung sound classification models can be designed. In this paper, we propose a novel hybrid neural model that implements the focal loss (FL) function to deal with training data imbalance. Features initially extracted from short-time Fourier transform (STFT) spectrograms via a convolutional neural network (CNN) are given as input to a long short-term memory (LSTM) network that memorizes the temporal dependencies between data and classifies four types of lung sounds, including normal, crackles, wheezes, and both crackles and wheezes. The model was trained and tested on the ICBHI 2017 Respiratory Sound Database and achieved state-of-the-art results using three different data splitting strategies—namely, sensitivity 47.37%, specificity 82.46%, score 64.92% and accuracy 73.69% for the official 60/40 split, sensitivity 52.78%, specificity 84.26%, score 68.52% and accuracy 76.39% using interpatient 10-fold cross validation, and sensitivity 60.29% and accuracy 74.57% using leave-one-out cross validation.},
author = {Petmezas, Georgios and Cheimariotis, Grigorios-Aris and Stefanopoulos, Leandros and Rocha, Bruno and Paiva, Rui Pedro and Katsaggelos, Aggelos K. and Maglaveras, Nicos},
doi = {10.3390/s22031232},
issn = {1424-8220},
journal = {Sensors},
keywords = {Asthma,CNN,COPD,Crackles,Focal loss,LSTM,Lung sounds,STFT,Wheezes},
month = {feb},
number = {3},
pages = {1232},
pmid = {35161977},
publisher = {MDPI},
title = {{Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function}},
url = {https://www.mdpi.com/1424-8220/22/3/1232},
volume = {22},
year = {2022}
}

Downloads: 0

{"_id":"8ka7aJRf5LfW9kcsQ","bibbaseid":"petmezas-cheimariotis-stefanopoulos-rocha-paiva-katsaggelos-maglaveras-automatedlungsoundclassificationusingahybridcnnlstmnetworkandfocallossfunction-2022","author_short":["Petmezas, G.","Cheimariotis, G.","Stefanopoulos, L.","Rocha, B.","Paiva, R. P.","Katsaggelos, A. K.","Maglaveras, N."],"bibdata":{"bibtype":"article","type":"article","abstract":"Respiratory diseases constitute one of the leading causes of death worldwide and directly affect the patient's quality of life. Early diagnosis and patient monitoring, which conventionally include lung auscultation, are essential for the efficient management of respiratory diseases. Manual lung sound interpretation is a subjective and time-consuming process that requires high medical expertise. The capabilities that deep learning offers could be exploited in order that robust lung sound classification models can be designed. In this paper, we propose a novel hybrid neural model that implements the focal loss (FL) function to deal with training data imbalance. Features initially extracted from short-time Fourier transform (STFT) spectrograms via a convolutional neural network (CNN) are given as input to a long short-term memory (LSTM) network that memorizes the temporal dependencies between data and classifies four types of lung sounds, including normal, crackles, wheezes, and both crackles and wheezes. The model was trained and tested on the ICBHI 2017 Respiratory Sound Database and achieved state-of-the-art results using three different data splitting strategies—namely, sensitivity 47.37%, specificity 82.46%, score 64.92% and accuracy 73.69% for the official 60/40 split, sensitivity 52.78%, specificity 84.26%, score 68.52% and accuracy 76.39% using interpatient 10-fold cross validation, and sensitivity 60.29% and accuracy 74.57% using leave-one-out cross validation.","author":[{"propositions":[],"lastnames":["Petmezas"],"firstnames":["Georgios"],"suffixes":[]},{"propositions":[],"lastnames":["Cheimariotis"],"firstnames":["Grigorios-Aris"],"suffixes":[]},{"propositions":[],"lastnames":["Stefanopoulos"],"firstnames":["Leandros"],"suffixes":[]},{"propositions":[],"lastnames":["Rocha"],"firstnames":["Bruno"],"suffixes":[]},{"propositions":[],"lastnames":["Paiva"],"firstnames":["Rui","Pedro"],"suffixes":[]},{"propositions":[],"lastnames":["Katsaggelos"],"firstnames":["Aggelos","K."],"suffixes":[]},{"propositions":[],"lastnames":["Maglaveras"],"firstnames":["Nicos"],"suffixes":[]}],"doi":"10.3390/s22031232","issn":"1424-8220","journal":"Sensors","keywords":"Asthma,CNN,COPD,Crackles,Focal loss,LSTM,Lung sounds,STFT,Wheezes","month":"feb","number":"3","pages":"1232","pmid":"35161977","publisher":"MDPI","title":"Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function","url":"https://www.mdpi.com/1424-8220/22/3/1232","volume":"22","year":"2022","bibtex":"@article{Petmezas2022a,\nabstract = {Respiratory diseases constitute one of the leading causes of death worldwide and directly affect the patient's quality of life. Early diagnosis and patient monitoring, which conventionally include lung auscultation, are essential for the efficient management of respiratory diseases. Manual lung sound interpretation is a subjective and time-consuming process that requires high medical expertise. The capabilities that deep learning offers could be exploited in order that robust lung sound classification models can be designed. In this paper, we propose a novel hybrid neural model that implements the focal loss (FL) function to deal with training data imbalance. Features initially extracted from short-time Fourier transform (STFT) spectrograms via a convolutional neural network (CNN) are given as input to a long short-term memory (LSTM) network that memorizes the temporal dependencies between data and classifies four types of lung sounds, including normal, crackles, wheezes, and both crackles and wheezes. The model was trained and tested on the ICBHI 2017 Respiratory Sound Database and achieved state-of-the-art results using three different data splitting strategies—namely, sensitivity 47.37%, specificity 82.46%, score 64.92% and accuracy 73.69% for the official 60/40 split, sensitivity 52.78%, specificity 84.26%, score 68.52% and accuracy 76.39% using interpatient 10-fold cross validation, and sensitivity 60.29% and accuracy 74.57% using leave-one-out cross validation.},\nauthor = {Petmezas, Georgios and Cheimariotis, Grigorios-Aris and Stefanopoulos, Leandros and Rocha, Bruno and Paiva, Rui Pedro and Katsaggelos, Aggelos K. and Maglaveras, Nicos},\ndoi = {10.3390/s22031232},\nissn = {1424-8220},\njournal = {Sensors},\nkeywords = {Asthma,CNN,COPD,Crackles,Focal loss,LSTM,Lung sounds,STFT,Wheezes},\nmonth = {feb},\nnumber = {3},\npages = {1232},\npmid = {35161977},\npublisher = {MDPI},\ntitle = {{Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function}},\nurl = {https://www.mdpi.com/1424-8220/22/3/1232},\nvolume = {22},\nyear = {2022}\n}\n","author_short":["Petmezas, G.","Cheimariotis, G.","Stefanopoulos, L.","Rocha, B.","Paiva, R. P.","Katsaggelos, A. K.","Maglaveras, N."],"key":"Petmezas2022a","id":"Petmezas2022a","bibbaseid":"petmezas-cheimariotis-stefanopoulos-rocha-paiva-katsaggelos-maglaveras-automatedlungsoundclassificationusingahybridcnnlstmnetworkandfocallossfunction-2022","role":"author","urls":{"Paper":"https://www.mdpi.com/1424-8220/22/3/1232"},"keyword":["Asthma","CNN","COPD","Crackles","Focal loss","LSTM","Lung sounds","STFT","Wheezes"],"metadata":{"authorlinks":{}}},"bibtype":"article","biburl":"https://sites.northwestern.edu/ivpl/files/2023/06/IVPL_Updated_publications-1.bib","dataSources":["KTWAakbPXLGfYseXn","ePKPjG8C6yvpk4mEK","ya2CyA73rpZseyrZ8","37Qfzv6wRptYkoSCL","zFPgsTDAW8aDnb5iN","E6Bth2QB5BYjBMZE7","nbnEjsN7MJhurAK9x","PNQZj6FjzoxxJk4Yi","7FpDWDGJ4KgpDiGfB","bod9ms4MQJHuJgPpp","QR9t5P2cLdJuzhfzK","D8k2SxfC5dKNRFgro","7Dwzbxq93HWrJEhT6","qhF8zxmGcJfvtdeAg","fvDEHD49E2ZRwE3fb","H7crv8NWhZup4d4by","DHqokWsryttGh7pJE","vRJd4wNg9HpoZSMHD","sYxQ6pxFgA59JRhxi","w2WahSbYrbcCKBDsC","XasdXLL99y5rygCmq","3gkSihZQRfAD2KBo3","t5XMbyZbtPBo4wBGS","bEpHM2CtrwW2qE8FP","CEvLkyL7ktgXkyLsM","E9G7xnNefvBN7vme5","e6n4tghcdNGQjnXfe","mgahWxMFCzYgy2PBX","teJzFLHexaz5AQW5z"],"keywords":["asthma","cnn","copd","crackles","focal loss","lstm","lung sounds","stft","wheezes"],"search_terms":["automated","lung","sound","classification","using","hybrid","cnn","lstm","network","focal","loss","function","petmezas","cheimariotis","stefanopoulos","rocha","paiva","katsaggelos","maglaveras"],"title":"Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function","year":2022}