论文标题
探索众包呼吸声数据的COVID-19自动诊断
Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data
论文作者
论文摘要
人体产生的音频信号(例如,叹息,呼吸,心脏,消化,振动声音)通常被临床医生用作诊断疾病或评估疾病进展的指标。直到最近,这些信号通常是通过预定访问时通过手动听诊来收集的。现在,研究已经开始使用数字技术来收集身体声音(例如,从数字听诊器)进行心血管或呼吸检查,然后可以自动分析。一些最初的工作表明,从语音和咳嗽中检测Covid-19的诊断信号有望。在本文中,我们描述了我们对收集的呼吸道声音的大规模众包数据集的数据分析,以帮助诊断Covid-19。我们使用咳嗽和呼吸来了解哮喘或健康对照中的Covid-19声音如何可见。我们的结果表明,即使是简单的二进制机器学习分类器,也能够正确地对健康和共证的声音进行分类。我们还展示了如何区分对Covid-19测试阳性并患有咳嗽的健康用户的用户,以及对Covid-19的呈阳性并伴有哮喘和咳嗽的用户咳嗽的用户。我们的模型在所有任务中达到了80%以上的AUC。这些结果是初步的,仅刮擦了这种类型的数据和基于音频的机器学习的潜力。这项工作为进一步研究如何将自动分析的呼吸模式用作筛选前信号开门打开了大门,以帮助Covid-19诊断。
Audio signals generated by the human body (e.g., sighs, breathing, heart, digestion, vibration sounds) have routinely been used by clinicians as indicators to diagnose disease or assess disease progression. Until recently, such signals were usually collected through manual auscultation at scheduled visits. Research has now started to use digital technology to gather bodily sounds (e.g., from digital stethoscopes) for cardiovascular or respiratory examination, which could then be used for automatic analysis. Some initial work shows promise in detecting diagnostic signals of COVID-19 from voice and coughs. In this paper we describe our data analysis over a large-scale crowdsourced dataset of respiratory sounds collected to aid diagnosis of COVID-19. We use coughs and breathing to understand how discernible COVID-19 sounds are from those in asthma or healthy controls. Our results show that even a simple binary machine learning classifier is able to classify correctly healthy and COVID-19 sounds. We also show how we distinguish a user who tested positive for COVID-19 and has a cough from a healthy user with a cough, and users who tested positive for COVID-19 and have a cough from users with asthma and a cough. Our models achieve an AUC of above 80% across all tasks. These results are preliminary and only scratch the surface of the potential of this type of data and audio-based machine learning. This work opens the door to further investigation of how automatically analysed respiratory patterns could be used as pre-screening signals to aid COVID-19 diagnosis.