Radio2Speech：从射频信号中恢复高质量的语音

论文标题

Radio2Speech：从射频信号中恢复高质量的语音

Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals

论文作者

Zhao, Running, Yu, Jiangtao, Li, Tingle, Zhao, Hang, Ngai, Edith C. H.

论文摘要

考虑到麦克风很容易受到噪音和隔音材料的影响，射频（RF）信号是恢复音频的有前途的候选人，因为它可以免疫噪声，并且可以穿越许多隔音对象。在本文中，我们介绍了Radio2Speech，该系统使用RF信号从扬声器中恢复高质量的语音。 Radio2Speech可以恢复与麦克风质量相当的语音，从仅恢复单调音乐或在现有方法中无法理解的语音来推进。我们使用无线电UNET从具有有限的频段的RF信号中准确地恢复时间频域中的语音。另外，我们将神经声码器纳入了从估计的时频表示中综合语音波形的情况，而无需使用被污染的相位。定量和定性评估表明，在安静，嘈杂和隔音的场景中，Radio2Speech实现了最先进的性能，并且与在安静场景中工作的麦克风相当。

Considering the microphone is easily affected by noise and soundproof materials, the radio frequency (RF) signal is a promising candidate to recover audio as it is immune to noise and can traverse many soundproof objects. In this paper, we introduce Radio2Speech, a system that uses RF signals to recover high quality speech from the loudspeaker. Radio2Speech can recover speech comparable to the quality of the microphone, advancing from recovering only single tone music or incomprehensible speech in existing approaches. We use Radio UNet to accurately recover speech in time-frequency domain from RF signals with limited frequency band. Also, we incorporate the neural vocoder to synthesize the speech waveform from the estimated time-frequency representation without using the contaminated phase. Quantitative and qualitative evaluations show that in quiet, noisy and soundproof scenarios, Radio2Speech achieves state-of-the-art performance and is on par with the microphone that works in quiet scenarios.

下载PDF全文

下载文献需遵守相关版权规定

论文标题