ARE THE THISTED-EFRON AUTHORSHIP TESTS VALID

被引:15
作者
VALENZA, RJ
机构
[1] Department of Mathematics, Claremont McKenna College, Claremont, 91711, California
来源
COMPUTERS AND THE HUMANITIES | 1991年 / 25卷 / 01期
关键词
STYLOMETRY; LITERARY DETECTION; THISTED-EFRON TESTS; SHAKESPEAREAN CANON;
D O I
10.1007/BF00054287
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We assess the validity of the Thisted-Efron authorship tests in two stages. First, we construct simulated texts in accordance with the assumptions implicit in the underlying model and use these to validate the basic computations, to determine their range of applicability, and to evaluate their sensitivity to basic lexical parameters. Second, we experiment with actual texts from the Shakespearean canon and the plays of Christopher Marlowe. The results of the tests are mixed, showing good consistency for the Shakespeare plays (with some discrimination among early, middle and late works) but poor consistency between Shakespeare's poems and plays, or among Marlowe's plays.
引用
收藏
页码:27 / 46
页数:20
相关论文
共 5 条
[1]   ESTIMATING NUMBER OF UNSEEN SPECIES - HOW MANY WORDS DID SHAKESPEARE KNOW [J].
EFRON, B ;
THISTED, R .
BIOMETRIKA, 1976, 63 (03) :435-447
[2]   The relation between the number of species and the number of individuals in a random sample of an animal population [J].
Fisher, RA ;
Corbet, AS ;
Williams, CB .
JOURNAL OF ANIMAL ECOLOGY, 1943, 12 :42-58
[3]  
Press W. H., 1992, NUMERICAL RECIPES EX
[4]  
THISTED R, 1986, BIOMETRIKA, V74, P445
[5]  
THISTED RA, 1988, ELEMENTS STATISTICAL