Bioinformatics for personal genome interpretation

被引:56
作者
Capriotti, Emidio [1 ,2 ]
Nehrt, Nathan L. [3 ]
Kann, Maricel G. [4 ]
Bromberg, Yana [5 ]
机构
[1] Univ Balearic Isl, Dept Math & Comp Sci, Palma De Mallorca 07122, Spain
[2] Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
[3] US FDA, Res Participat Program, Rockville, MD 20857 USA
[4] Univ Maryland Baltimore Cty, Dept Biol Sci, Baltimore, MD 21250 USA
[5] Rutgers State Univ, Dept Biochem & Microbiol, Sch Environm & Biol Sci, New Brunswick, NJ 08901 USA
基金
美国国家卫生研究院;
关键词
genomic variation; genome interpretation; genomic variant databases; gene prioritization; deleterious variants; PROTEIN STABILITY CHANGES; NON-SYNONYMOUS SNPS; SINGLE NUCLEOTIDE POLYMORPHISMS; GENE PRIORITIZATION; SEMANTIC SIMILARITY; CANDIDATE GENES; WEB SERVER; IN-SILICO; NEIGHBORING GENES; SOMATIC MUTATIONS;
D O I
10.1093/bib/bbr070
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
An international consortium released the first draft sequence of the human genome 10 years ago. Although the analysis of this data has suggested the genetic underpinnings of many diseases, we have not yet been able to fully quantify the relationship between genotype and phenotype. Thus, a major current effort of the scientific community focuses on evaluating individual predispositions to specific phenotypic traits given their genetic backgrounds. Many resources aim to identify and annotate the specific genes responsible for the observed phenotypes. Some of these use intra-species genetic variability as a means for better understanding this relationship. In addition, several online resources are now dedicated to collecting single nucleotide variants and other types of variants, and annotating their functional effects and associations with phenotypic traits. This information has enabled researchers to develop bioinformatics tools to analyze the rapidly increasing amount of newly extracted variation data and to predict the effect of uncharacterized variants. In this work, we review the most important developments in the field-the databases and bioinformatics tools that will be of utmost importance in our concerted effort to interpret the human variome.
引用
收藏
页码:495 / 512
页数:18
相关论文
共 177 条
[1]   Speeding disease gene discovery by sequence based candidate prioritization [J].
Adie, EA ;
Adams, RR ;
Evans, KL ;
Porteous, DJ ;
Pickard, BS .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]   SUSPECTS: enabling fast and effective prioritization of positional candidates [J].
Adie, EA ;
Adams, RR ;
Evans, KL ;
Porteous, DJ ;
Pickard, BS .
BIOINFORMATICS, 2006, 22 (06) :773-774
[3]   Prediction of human disease genes by human-mouse conserved coexpression analysis [J].
Ala, Ugo ;
Piro, Rosario Michael ;
Grassi, Elena ;
Damasco, Christian ;
Silengo, Lorenzo ;
Oti, Martin ;
Provero, Paolo ;
Di Cunto, Ferdinando .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (03)
[4]   Text mining for biology - the way forward: opinions from leading scientists [J].
Altman, Russ B. ;
Bergman, Casey M. ;
Blake, Judith ;
Blaschke, Christian ;
Cohen, Aaron ;
Gannon, Frank ;
Grivell, Les ;
Hahn, Udo ;
Hersh, William ;
Hirschman, Lynette ;
Jensen, Lars Juhl ;
Krallinger, Martin ;
Mons, Barend ;
O'Donoghue, Sean I. ;
Peitsch, Manuel C. ;
Rebholz-Schuhmann, Dietrich ;
Shatkay, Hagit ;
Valencia, Alfonso .
GENOME BIOLOGY, 2008, 9
[5]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[6]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[7]   McKusick's Online Mendelian Inheritance in Man (OMIM®) [J].
Amberger, Joanna ;
Bocchini, Carol A. ;
Scott, Alan F. ;
Hamosh, Ada .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D793-D796
[8]   In silico detection of sequence variations modifying transcriptional regulation [J].
Andersen, Malin C. ;
Engstrom, Par R. G. ;
Lithwick, Stuart ;
Arenillas, David ;
Eriksson, Per ;
Lenhard, Boris ;
Wasserman, Wyeth W. ;
Odeberg, Jacob .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (01) :0043-0054
[9]   Selective pressures at a codon-level predict deleterious mutations in human disease genes [J].
Arbiza, Leonardo ;
Duchi, Serena ;
Montaner, David ;
Burguet, Jordi ;
Pantoja-Uceda, David ;
Pineda-Lucena, Antonio ;
Dopazo, Joaquin ;
Dopazo, Hernan .
JOURNAL OF MOLECULAR BIOLOGY, 2006, 358 (05) :1390-1404
[10]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29