Practical application of ontologies to annotate and analyse large scale raw mouse phenotype data

被引:36
作者
Beck, Tim [1 ]
Morgan, Hugh [1 ]
Blake, Andrew [1 ]
Wells, Sara [1 ]
Hancock, John M. [1 ]
Mallon, Ann-Marie [1 ]
机构
[1] MRC Harwell, Didcot OX11 0RD, Oxon, England
来源
BMC BIOINFORMATICS | 2009年 / 10卷
基金
英国医学研究理事会;
关键词
TOOL; MUTAGENESIS; EMPRESS;
D O I
10.1186/1471-2105-10-S5-S2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Large-scale international projects are underway to generate collections of knockout mouse mutants and subsequently to perform high throughput phenotype assessments, raising new challenges for computational researchers due to the complexity and scale of the phenotype data. Phenotypes can be described using ontologies in two differing methodologies. Traditionally an individual phenotypic character has either been defined using a single compound term, originating from a species-specific dedicated phenotype ontology, or alternatively by a combinatorial annotation, using concepts from a range of disparate ontologies, to define a phenotypic character as an entity with an associated quality (EQ). Both methods have their merits, which include the dedicated approach allowing use of community standard terminology, and the combinatorial approach facilitating cross-species phenotypic statement comparisons. Previously databases have favoured one approach over another. The EUMODIC project will generate large amounts of mouse phenotype data, generated as a result of the execution of a set of Standard Operating Procedures (SOPs) and will implement both ontological approaches to capture the phenotype data generated. Results: For all SOPs a four-tier annotation is made: a high-level description of the SOP, to broadly define the type of data generated by the SOP; individual parameter annotation using the EQ model; annotation of the qualitative data generated for each mouse; and the annotation of mutant lines after statistical analysis. The qualitative assessments of phenodeviance are made at the point of data entry, using child PATO qualities to the parameter quality. To facilitate data querying by scientists more familiar with single compound terms to describe phenotypes, the mappings between the Mammalian Phenotype (MP) ontology and the EQ PATO model are exploited to allow querying via MP terms. Conclusion: Well-annotated and comparable phenotype databases can be achieved through the use of ontologically derived comparable phenotypic statements and have been implemented here by means of OBO compatible EQ annotations. The implementation we describe also sees scientists working seamlessly with ontologies through the assessment of qualitative phenotypes in terms of PATO qualities and the ability to query the database using community-accepted compound MP terms. This work represents the first time the combinatorial and single-dedicated approaches have both been implemented to annotate a phenotypic dataset.
引用
收藏
页数:9
相关论文
共 21 条
  • [1] [Anonymous], 2006, GO Slim and subset guide
  • [2] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [3] The European dimension for the mouse genome mutagenesis program
    Auwerx, J
    Avner, P
    Baldock, R
    Ballabio, A
    Balling, R
    Barbacid, M
    Berns, A
    Bradley, A
    Brown, S
    Carmeliet, P
    Chambon, P
    Cox, R
    Davidson, D
    Davies, K
    Duboule, D
    Forejt, J
    Granucci, F
    Hastie, N
    de Angelis, MH
    Jackson, I
    Kioussis, D
    Kollias, G
    Lathrop, M
    Lendahl, U
    Malumbres, M
    von Melchner, H
    Müller, W
    Partanen, J
    Ricciardi-Castagnoli, P
    Rigby, P
    Rosen, B
    Rosenthal, N
    Skarnes, B
    Stewart, AF
    Thornton, J
    Tocchini-Valentini, G
    Wagner, E
    Wahli, W
    Wurst, W
    [J]. NATURE GENETICS, 2004, 36 (09) : 925 - 927
  • [4] EMPReSS: standardized phenotype screens for functional annotation of the mouse genome
    Brown, SDM
    Chambon, P
    de Angelis, MH
    [J]. NATURE GENETICS, 2005, 37 (11) : 1155 - 1155
  • [5] Understanding mammalian genetic systems: The challenge of phenotyping in the mouse
    Brown, Steve D. M.
    Hancock, John M.
    Gates, Hilary
    [J]. PLOS GENETICS, 2006, 2 (08): : 1131 - 1137
  • [6] The Mouse Genome Database (MGD): mouse biology and model systems
    Bult, Carol J.
    Eppig, Janan T.
    Kadin, James A.
    Richardson, Joel E.
    Blake, Judith A.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D724 - D728
  • [7] A mouse for all reasons
    Collins, Francis S.
    [J]. CELL, 2007, 128 (01) : 9 - 13
  • [8] N-ethyl-N-nitrosourea mutagenesis:: Boarding the mouse mutant express
    Cordes, SP
    [J]. MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 2005, 69 (03) : 426 - +
  • [9] OBO-Edit - an ontology editor for biologists
    Day-Richter, John
    Harris, Midori A.
    Haendel, Melissa
    [J]. BIOINFORMATICS, 2007, 23 (16) : 2198 - 2200
  • [10] ChEBI:: a database and ontology for chemical entities of biological interest
    Degtyarenko, Kirill
    de Matos, Paula
    Ennis, Marcus
    Hastings, Janna
    Zbinden, Martin
    McNaught, Alan
    Alcantara, Rafael
    Darsow, Michael
    Guedj, Mickael
    Ashburner, Michael
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D344 - D350