Judging the significance of multiple linear regression models

被引:67
作者
Livingstone, DJ
Salt, DW
机构
[1] ChemQuest, Sandown PO36 8LZ, England
[2] Univ Portsmouth, Ctr Mol Design, Portsmouth PO1 2UP, Hants, England
[3] Univ Portsmouth, Dept Math, Portsmouth PO1 2UP, Hants, England
关键词
D O I
10.1021/jm049111p
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
It is common practice to calculate large numbers of molecular descriptors, apply variable selection procedures to reduce the numbers, and then construct multiple linear regression (MLR) models with biological activity. The significance of these models is judged using the usual statistical tests. Unfortunately, these tests are not appropriate under these circumstances since the MLR models suffer from "selection bias". Experiments with regression using random numbers have generated critical values (F-max) with which to assess significance.
引用
收藏
页码:661 / 663
页数:3
相关论文
共 6 条
[1]  
LIVINGSTONE DJ, IN PRESS REV COMPUTA
[2]  
LIVINGSTONE DJ, 1991, METHOD ENZYMOL, V203, P613
[3]   SELECTION OF SUBSETS OF REGRESSION VARIABLES [J].
MILLER, AJ .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1984, 147 :389-425
[4]   STRUCTURE-ACTIVITY-RELATIONSHIPS OF ANTIFILARIAL ANTIMYCIN ANALOGS - A MULTIVARIATE PATTERN-RECOGNITION STUDY [J].
SELWOOD, DL ;
LIVINGSTONE, DJ ;
COMLEY, JCW ;
ODOWD, AB ;
HUDSON, AT ;
JACKSON, P ;
JANDU, KS ;
ROSE, VS ;
STABLES, JN .
JOURNAL OF MEDICINAL CHEMISTRY, 1990, 33 (01) :136-142
[5]  
Todeschini R., 2008, HDB MOL DESCRIPTORS
[6]   CHANCE FACTORS IN STUDIES OF QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS [J].
TOPLISS, JG ;
EDWARDS, RP .
JOURNAL OF MEDICINAL CHEMISTRY, 1979, 22 (10) :1238-1244