使用外部标签和内域预处理增强多模式变压器：可恶的模因挑战解决方案

论文标题

使用外部标签和内域预处理增强多模式变压器：可恶的模因挑战解决方案

Enhance Multimodal Transformer With External Label And In-Domain Pretrain: Hateful Meme Challenge Winning Solution

论文作者

Zhu, Ron

论文摘要

仇恨的模因检测是最近提出的一个新的研究领域，它需要对模因的视觉，语言理解和一些背景知识才能在任务上表现良好。该技术报告总结了2020年仇恨模因检测挑战的第一名解决方案，该解决方案扩展了最先进的视觉语言变压器来解决此问题。在报告的结尾，我们还指出了改善当前方法的缺点和可能的方向。

Hateful meme detection is a new research area recently brought out that requires both visual, linguistic understanding of the meme and some background knowledge to performing well on the task. This technical report summarises the first place solution of the Hateful Meme Detection Challenge 2020, which extending state-of-the-art visual-linguistic transformers to tackle this problem. At the end of the report, we also point out the shortcomings and possible directions for improving the current methodology.

下载PDF全文

下载文献需遵守相关版权规定

论文标题