Audino：音频和语音的现代注释工具

论文标题

Audino：音频和语音的现代注释工具

audino: A Modern Annotation Tool for Audio and Speech

论文作者

Grover, Manraj Singh, Bamdev, Pakhi, Brala, Ratin Kumar, Kumar, Yaman, Hama, Mika, Shah, Rajiv Ratn

论文摘要

在本文中，我们介绍了音频和语音的协作和现代注释工具：Audino。该工具允许注释者在音频中定义和描述时间分段。这些片段可以使用动态生成的形式轻松地标记和转录。管理员可以通过管理仪表板中心控制用户角色和项目分配。仪表板还启用描述标签及其值。可以轻松地以JSON格式导出注释以进行进一步分析。该工具允许音频数据及其相应的注释通过基于密钥的API上传并分配给用户。注释工具中可用的灵活性可以进行语音评分，语音活动检测（VAD），扬声器诊断，扬声器识别，语音识别，情感识别任务等方面的注释。麻省理工学院的开源许可证可用于学术和商业项目。

In this paper, we introduce a collaborative and modern annotation tool for audio and speech: audino. The tool allows annotators to define and describe temporal segmentation in audios. These segments can be labelled and transcribed easily using a dynamically generated form. An admin can centrally control user roles and project assignment through the admin dashboard. The dashboard also enables describing labels and their values. The annotations can easily be exported in JSON format for further analysis. The tool allows audio data and their corresponding annotations to be uploaded and assigned to a user through a key-based API. The flexibility available in the annotation tool enables annotation for Speech Scoring, Voice Activity Detection (VAD), Speaker Diarisation, Speaker Identification, Speech Recognition, Emotion Recognition tasks and more. The MIT open source license allows it to be used for academic and commercial projects.

下载PDF全文

下载文献需遵守相关版权规定

论文标题