A rescue strategy for multimapping short sequence tags refines surveys of transcriptional activity by CAGE

被引:68
作者
Faulkner, Geoffrey J. [1 ]
Forrest, Alistair R. R. [2 ,3 ]
Chalk, Alistair M. [3 ]
Schroder, Kate [1 ]
Hayashizaki, Yoshihide [2 ]
Carninci, Piero [2 ,4 ]
Hume, David A. [5 ]
Grimmond, Sean M. [1 ]
机构
[1] Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, Australia
[2] RIKEN Yokohama Inst, RIKEN Genom Sci Ctr, Genome Explorat Res Grp Genome Network Project Co, Yokohama, Kanagawa 2300045, Japan
[3] Griffith Univ, Eskitis Inst Cell & Mol Therapies, Brisbane, Qld 4111, Australia
[4] RIKEN Wako Inst, Discovery Res Inst, Genome Sci Lab, Wako, Saitama 3510198, Japan
[5] Univ Edinburgh, Roslin Inst, Roslin EH25 9PS, Midlothian, Scotland
基金
英国医学研究理事会; 澳大利亚研究理事会;
关键词
transcriptome; promoter; CAGE; microarray;
D O I
10.1016/j.ygeno.2007.11.003
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Cap analysis gene expression (CAGE) is a high-throughput, tag-based method designed to survey the 5' end of capped full-length cDNAs. CAGE has previously been used to define global transcription start site usage and monitor gene activity in mammals. A drawback of the CAGE approach thus far has been the removal of as many as 40% of CAGE sequence tags due to their mapping to multiple genomic locations. Here, we address the origins of multimap tags and present a novel strategy to assign CAGE tags to their most likely source promoter region. When this approach was applied to the FANTOM3 CAGE libraries, the percentage of protein-coding mouse transcriptional frameworks detected by CAGE improved from 42.9 to 57.8% (an increase of 5516 frameworks) with no reduction in CAGE to microarray correlation. These results suggest that the multimap tags produced by high-throughput, short sequence tag-based approaches can be rescued to augment greatly the transcriptome coverage provided by single-map tags alone. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:281 / 288
页数:8
相关论文
共 26 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] MEME: discovering and analyzing DNA and protein sequence motifs
    Bailey, Timothy L.
    Williams, Nadya
    Misleh, Chris
    Li, Wilfred W.
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : W369 - W373
  • [3] Mice and men:: Their promoter properties
    Bajic, Vladimir B.
    Tan, Sin Lam
    Christoffels, Alan
    Schonbach, Christian
    Lipovich, Leonard
    Yang, Liang
    Hofmann, Oliver
    Kruger, Adele
    Hide, Winston
    Kai, Chikatoshi
    Kawai, Jun
    Hume, David A.
    Carninci, Piero
    Hayashizaki, Yoshihide
    [J]. PLOS GENETICS, 2006, 2 (04): : 614 - 626
  • [4] Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project
    Birney, Ewan
    Stamatoyannopoulos, John A.
    Dutta, Anindya
    Guigo, Roderic
    Gingeras, Thomas R.
    Margulies, Elliott H.
    Weng, Zhiping
    Snyder, Michael
    Dermitzakis, Emmanouil T.
    Stamatoyannopoulos, John A.
    Thurman, Robert E.
    Kuehn, Michael S.
    Taylor, Christopher M.
    Neph, Shane
    Koch, Christoph M.
    Asthana, Saurabh
    Malhotra, Ankit
    Adzhubei, Ivan
    Greenbaum, Jason A.
    Andrews, Robert M.
    Flicek, Paul
    Boyle, Patrick J.
    Cao, Hua
    Carter, Nigel P.
    Clelland, Gayle K.
    Davis, Sean
    Day, Nathan
    Dhami, Pawandeep
    Dillon, Shane C.
    Dorschner, Michael O.
    Fiegler, Heike
    Giresi, Paul G.
    Goldy, Jeff
    Hawrylycz, Michael
    Haydock, Andrew
    Humbert, Richard
    James, Keith D.
    Johnson, Brett E.
    Johnson, Ericka M.
    Frum, Tristan T.
    Rosenzweig, Elizabeth R.
    Karnani, Neerja
    Lee, Kirsten
    Lefebvre, Gregory C.
    Navas, Patrick A.
    Neri, Fidencio
    Parker, Stephen C. J.
    Sabo, Peter J.
    Sandstrom, Richard
    Shafer, Anthony
    [J]. NATURE, 2007, 447 (7146) : 799 - 816
  • [5] Carninci P, 1999, METHOD ENZYMOL, V303, P19
  • [6] Targeting a complex transcriptome: The construction of the mouse full-length cDNA encyclopedia
    Carninci, P
    Waki, K
    Shiraki, T
    Konno, H
    Shibata, K
    Itoh, M
    Aizawa, K
    Arakawa, T
    Ishii, Y
    Sasaki, D
    Bono, H
    Kondo, S
    Sugahara, Y
    Saito, R
    Osato, N
    Fukuda, S
    Sato, K
    Watahiki, A
    Hirozane-Kishikawa, T
    Nakamura, M
    Shibata, Y
    Yasunishi, A
    Kikuchi, N
    Yoshiki, A
    Kusakabe, M
    Gustincich, S
    Beisel, K
    Pavan, W
    Aidinis, V
    Nakagawara, A
    Held, WA
    Iwata, H
    Kono, T
    Nakauchi, H
    Lyons, P
    Wells, C
    Hume, DA
    Fagiolini, M
    Hensch, TK
    Brinkmeier, M
    Camper, S
    Hirota, J
    Mombaerts, P
    Muramatsu, M
    Okazaki, Y
    Kawai, J
    Hayashizaki, Y
    [J]. GENOME RESEARCH, 2003, 13 (6B) : 1273 - 1289
  • [7] High-efficiency full-length cDNA cloning by biotinylated CAP trapper
    Carninci, P
    Kvam, C
    Kitamura, A
    Ohsumi, T
    Okazaki, Y
    Itoh, M
    Kamiya, M
    Shibata, K
    Sasaki, N
    Izawa, M
    Muramatsu, M
    Hayashizaki, Y
    Schneider, C
    [J]. GENOMICS, 1996, 37 (03) : 327 - 336
  • [8] The transcriptional landscape of the mammalian genome
    Carninci, P
    Kasukawa, T
    Katayama, S
    Gough, J
    Frith, MC
    Maeda, N
    Oyama, R
    Ravasi, T
    Lenhard, B
    Wells, C
    Kodzius, R
    Shimokawa, K
    Bajic, VB
    Brenner, SE
    Batalov, S
    Forrest, ARR
    Zavolan, M
    Davis, MJ
    Wilming, LG
    Aidinis, V
    Allen, JE
    Ambesi-Impiombato, X
    Apweiler, R
    Aturaliya, RN
    Bailey, TL
    Bansal, M
    Baxter, L
    Beisel, KW
    Bersano, T
    Bono, H
    Chalk, AM
    Chiu, KP
    Choudhary, V
    Christoffels, A
    Clutterbuck, DR
    Crowe, ML
    Dalla, E
    Dalrymple, BP
    de Bono, B
    Della Gatta, G
    di Bernardo, D
    Down, T
    Engstrom, P
    Fagiolini, M
    Faulkner, G
    Fletcher, CF
    Fukushima, T
    Furuno, M
    Futaki, S
    Gariboldi, M
    [J]. SCIENCE, 2005, 309 (5740) : 1559 - 1563
  • [9] Genome-wide analysis of mammalian promoter architecture and evolution
    Carninci, Piero
    Sandelin, Albin
    Lenhard, Boris
    Katayama, Shintaro
    Shimokawa, Kazuro
    Ponjavic, Jasmina
    Semple, Colin A. M.
    Taylor, Martin S.
    Engström, Par G.
    Frith, Martin C.
    Forrest, Alistair R. R.
    Alkema, Wynand B.
    Tan, Sin Lam
    Plessy, Charles
    Kodzius, Rimantas
    Ravasi, Timothy
    Kasukawa, Takeya
    Fukuda, Shiro
    Kanamori-Katayama, Mutsumi
    Kitazume, Yayoi
    Kawaji, Hideya
    Kai, Chikatoshi
    Nakamura, Mari
    Konno, Hideaki
    Nakano, Kenji
    Mottagui-Tabar, Salim
    Arner, Peter
    Chesi, Alessandra
    Gustincich, Stefano
    Persichetti, Francesca
    Suzuki, Harukazu
    Grimmond, Sean M.
    Wells, Christine A.
    Orlando, Valerio
    Wahlestedt, Claes
    Liu, Edison T.
    Harbers, Matthias
    Kawai, Jun
    Bajic, Vladimir B.
    Hume, David A.
    Hayashizaki, Yoshihide
    [J]. NATURE GENETICS, 2006, 38 (06) : 626 - 635
  • [10] Complex loci in human and mouse genomes
    Engstrom, Par G.
    Suzuki, Harukazu
    Ninomiya, Noriko
    Akalin, Altuna
    Sessa, Luca
    Lavorgna, Giovanni
    Brozzi, Alessandro
    Luzi, Lucilla
    Tan, Sin Lam
    Yang, Liang
    Kunarso, Galih
    ng, Edwin Lian-Cho Ng
    Batalov, Serge
    Wahlestedt, Claes
    Kai, Chikatoshi
    Kawai, Jun
    Carninci, Piero
    Hayashizaki, Yoshihide
    Wells, Christine
    Bajic, Vladimir B.
    Orlando, Valerio
    Reid, James F.
    Lenhard, Boris
    Lipovich, Leonard
    [J]. PLOS GENETICS, 2006, 2 (04): : 564 - 577