在科学出版物中进行视觉摘要识别的自学学习

论文标题

在科学出版物中进行视觉摘要识别的自学学习

Self-Supervised Learning for Visual Summary Identification in Scientific Publications

论文作者

Yamamoto, Shintaro, Lauscher, Anne, Ponzetto, Simone Paolo, Glavaš, Goran, Morishima, Shigeo

论文摘要

提供科学出版物的视觉摘要可以增加读者的信息访问，从而有助于应对科学出版物数量的指数增长。尽管如此，提供视觉出版物摘要的努力很少，并且主要关注生物医学领域。这主要是因为带注释的黄金标准的可用性有限，这阻碍了强大而高性能的监督学习技术的应用。为了解决这些问题，我们创建了一个新的基准数据集，用于选择数字，以作为其摘要的出版物的视觉摘要，涵盖计算机科学中的几个领域。此外，我们基于对数字字幕的数字的启发式匹配，开发了一种自我监督的学习方法。生物医学和计算机科学领域的实验表明，尽管有自我监督，因此我们的模型能够超越最先进的现状，因此不依赖任何带注释的培训数据。

Providing visual summaries of scientific publications can increase information access for readers and thereby help deal with the exponential growth in the number of scientific publications. Nonetheless, efforts in providing visual publication summaries have been few and far apart, primarily focusing on the biomedical domain. This is primarily because of the limited availability of annotated gold standards, which hampers the application of robust and high-performing supervised learning techniques. To address these problems we create a new benchmark dataset for selecting figures to serve as visual summaries of publications based on their abstracts, covering several domains in computer science. Moreover, we develop a self-supervised learning approach, based on heuristic matching of inline references to figures with figure captions. Experiments in both biomedical and computer science domains show that our model is able to outperform the state of the art despite being self-supervised and therefore not relying on any annotated training data.

下载PDF全文

下载文献需遵守相关版权规定

论文标题