Ngoc Phuoc An Vo, Octavian Popescu
LREC 2016
We present a work to evaluate the hypothesis that automatic evaluation metrics developed for Machine Translation (MT) systems have significant impact on predicting semantic similarity scores in Semantic Textual Similarity (STS) task for English, in light of their usage for paraphrase identification. We show that different metrics may have different behaviors and significance along the semantic scale [0-5] of the STS task. In addition, we compare several classification algorithms using a combination of different MT metrics to build an STS system; consequently, we show that although this approach obtains state of the art result in paraphrase identification task, it is insufficient to achieve the same result in STS.
Ngoc Phuoc An Vo, Octavian Popescu
LREC 2016
Ngoc Phuoc An Vo, Simone Magnolini, et al.
SocialNLP 2015
Octavian Popescu, Ngoc Phuoc An Vo, et al.
LREC 2018
Ngoc Phuoc An Vo, Octavian Popescu
RANLP 2015