Corpus Wide Argument Mining - A Working Solution

Liat Ein-Dor; Eyal Shnarch; Lena Dankin; Alon Halfon; Benjamin Sznajder; Ariel Gera; Carlos Alzate; Martin Gleize; Leshem Choshen; Yufang Hou; Yonatan Bilu; Ranit Aharonov; Noam Slonim

AAAI 2020

Conference paper

07 Feb 2020

Corpus Wide Argument Mining - A Working Solution

Download paper

Abstract

One of the main tasks in argument mining is the retrieval of argumentative content pertaining to a given topic. Most previous work addressed this task by retrieving a relatively small number of relevant documents as the initial source for such content. This line of research yielded moderate success, which is of limited use in a real-world system. Furthermore, for such a system to yield a comprehensive set of relevant arguments, over a wide range of topics, it requires leveraging a large and diverse corpus in an appropriate manner. Here we present a first end-To-end high-precision, corpus-wide argument mining system. This is made possible by combining sentence-level queries over an appropriate indexing of a very large corpus of newspaper articles, with an iterative annotation scheme. This scheme addresses the inherent label bias in the data and pinpoints the regions of the sample space whose manual labeling is required to obtain high-precision among top-ranked candidates.

Conference paper