论文标题
实时面孔和具有里程碑意义的本地化,以进行震呼检测
Real-Time Face and Landmark Localization for Eyeblink Detection
论文作者
论文摘要
Pavlovian Eykblink条件是一种在神经科学领域中使用的强大实验,用于测量我们在日常生活中学习的多个方面。为了在实验过程中跟踪眼睑的运动,研究人员传统上使用电位仪或肌电图。最近,计算机视觉和图像处理的使用减轻了对这些技术的需求,但是目前采用的方法需要人为干预,并且不够快,无法实现实时处理。在这项工作中,已经仔细组合了面部和地标检测算法,以提供全自动的眼睑跟踪,并进一步加速以迈向在线闭环实验的第一步。到目前为止,此类实验尚未实现,并有望在神经和精神疾病的运作中提供重要的见解。根据广泛的文献搜索,已经分析和评估了各种不同的面部检测算法和地标检测算法。两种算法被鉴定为最适合眼睑检测:用于面部检测的直方图(HOG)算法和用于地标检测的整体回归树(ERT)算法。这两种算法已在GPU和CPU上加速,分别达到1,753 $ \ times $和11 $ \ times $的加速度。为了证明眼睑检测算法的有用性,形成了研究假设,并采用了良好的神经科学实验:Eykeblink检测。我们的实验评估显示,每帧的总体应用运行时为0.533毫秒,比顺序实现快1,101 $ \ times $,并且符合人类的千旋通际条件的实时要求,即每秒更快的速度超过500帧。
Pavlovian eyeblink conditioning is a powerful experiment used in the field of neuroscience to measure multiple aspects of how we learn in our daily life. To track the movement of the eyelid during an experiment, researchers have traditionally made use of potentiometers or electromyography. More recently, the use of computer vision and image processing alleviated the need for these techniques but currently employed methods require human intervention and are not fast enough to enable real-time processing. In this work, a face- and landmark-detection algorithm have been carefully combined in order to provide fully automated eyelid tracking, and have further been accelerated to make the first crucial step towards online, closed-loop experiments. Such experiments have not been achieved so far and are expected to offer significant insights in the workings of neurological and psychiatric disorders. Based on an extensive literature search, various different algorithms for face detection and landmark detection have been analyzed and evaluated. Two algorithms were identified as most suitable for eyelid detection: the Histogram-of-Oriented-Gradients (HOG) algorithm for face detection and the Ensemble-of-Regression-Trees (ERT) algorithm for landmark detection. These two algorithms have been accelerated on GPU and CPU, achieving speedups of 1,753$\times$ and 11$\times$, respectively. To demonstrate the usefulness of our eyelid-detection algorithm, a research hypothesis was formed and a well-established neuroscientific experiment was employed: eyeblink detection. Our experimental evaluation reveals an overall application runtime of 0.533 ms per frame, which is 1,101$\times$ faster than the sequential implementation and well within the real-time requirements of eyeblink conditioning in humans, i.e. faster than 500 frames per second.