论文标题
PACMAN:脉搏血氧仪数字检测和阅读的框架在低资源设置中
PACMAN: a framework for pulse oximeter digit detection and reading in a low-resource setting
论文作者
论文摘要
鉴于Covid-19的大流行,要求患者手动输入其每日氧饱和度(SPO2)和脉搏率(PR)值(脉搏率(PR)值)中的健康监测系统 - 不幸的是,这种过程趋势是打字中的错误。几项研究试图使用光学特征识别(OCR)从捕获的图像中检测生理价值。但是,该技术的可用性有限,成本高。因此,这项研究旨在提出一个名为Pacman(大流行加速的人机合作)的新型框架,并具有低资源深度学习的计算机视觉。我们比较了最先进的对象检测算法(缩放的Yolov4,Yolov5和Yolor),包括从脉搏血氧仪显示器中捕获的图像上的数字识别的商业OCR工具。所有图像均来自众包数据收集的质量和对齐方式。 Yolov5是针对所有数据集的给定模型比较的表现最佳模型,尤其是正确定向的图像数据集。我们通过数字自动取向算法进一步提高了模型性能,并应用了聚类算法来提取SPO2和PR值。 Yolov5具有实施的准确性性能约为81.0-89.5%,与没有任何其他实施的情况相比,这得到了增强。因此,这项研究强调了PACMAN框架的完成,以检测和读取现实世界数据集中的数字。该拟议的框架目前已集成到全国医院使用的患者监测系统中。
In light of the COVID-19 pandemic, patients were required to manually input their daily oxygen saturation (SpO2) and pulse rate (PR) values into a health monitoring system-unfortunately, such a process trend to be an error in typing. Several studies attempted to detect the physiological value from the captured image using optical character recognition (OCR). However, the technology has limited availability with high cost. Thus, this study aimed to propose a novel framework called PACMAN (Pandemic Accelerated Human-Machine Collaboration) with a low-resource deep learning-based computer vision. We compared state-of-the-art object detection algorithms (scaled YOLOv4, YOLOv5, and YOLOR), including the commercial OCR tools for digit recognition on the captured images from pulse oximeter display. All images were derived from crowdsourced data collection with varying quality and alignment. YOLOv5 was the best-performing model against the given model comparison across all datasets, notably the correctly orientated image dataset. We further improved the model performance with the digits auto-orientation algorithm and applied a clustering algorithm to extract SpO2 and PR values. The accuracy performance of YOLOv5 with the implementations was approximately 81.0-89.5%, which was enhanced compared to without any additional implementation. Accordingly, this study highlighted the completion of PACMAN framework to detect and read digits in real-world datasets. The proposed framework has been currently integrated into the patient monitoring system utilized by hospitals nationwide.