What Is a Paraphrase?

被引:126
作者
Bhagat, Rahul [1 ]
Hovy, Eduard [1 ]
机构
[1] USC Informat Sci Inst, Los Angeles, CA USA
关键词
D O I
10.1162/COLI_a_00166
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Paraphrases are sentences or phrases that convey the same meaning using different wording. Although the logical definition of paraphrases requires strict semantic equivalence, linguistics accepts a broader, approximate, equivalencethereby allowing far more examples of quasi-paraphrase. But approximate equivalence is hard to define. Thus, the phenomenon of paraphrases, as understood in linguistics, is difficult to characterize. In this article, we list a set of 25 operations that generate quasi-paraphrases. We then empirically validate the scope and accuracy of this list by manually analyzing random samples of two publicly available paraphrase corpora. We provide the distribution of naturally occurring quasi-paraphrases in English text.
引用
收藏
页码:463 / 472
页数:10
相关论文
共 13 条
[1]
[Anonymous], 2004, P INT C COMP LING
[2]
CLARK EV, 1992, FRAMES, FIELDS, AND CONTRASTS, P171
[3]
Constructing Corpora for the Development and Evaluation of Paraphrase Systems [J].
Cohn, Trevor ;
Callison-Burch, Chris ;
Lapata, Mirella .
COMPUTATIONAL LINGUISTICS, 2008, 34 (04) :597-614
[4]
de Beaugrande Robert., 1981, INTRO TEXT LINGUISTI
[5]
Dekang Lin, 2001, KDD-2001. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P323
[6]
Fellbaum C, 1998, ELECT LEXICAL DATABA
[7]
Harris ZelligS., 1981, Papers on Syntax, P143, DOI DOI 10.1007/978-94-009-8467-7_8
[8]
Hirst G., 2003, ACL INT WORKSH PAR S
[9]
STUDY OF PARAPHRASES [J].
HONECK, RP .
JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1971, 10 (04) :367-381
[10]
Huang Shudong, 2002, Multiple-translation chinese corpus