A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages

被引:29
作者
Garmire, Lana X. [1 ]
Garmire, David G. [2 ]
Huang, Wendy [3 ]
Yao, Joyee [3 ]
Glass, Christopher K. [3 ]
Subramaniam, Shankar [1 ,3 ]
机构
[1] Univ Calif San Diego, Dept Bioengn, Jacobs Sch Engn, La Jolla, CA 92093 USA
[2] Univ Hawaii Manoa, Dept Elect Engn, Honolulu, HI 96822 USA
[3] Univ Calif San Diego, Dept Cellular & Mol Med, Sch Med, La Jolla, CA 92093 USA
来源
PLOS ONE | 2011年 / 6卷 / 09期
基金
美国国家卫生研究院;
关键词
GENE-EXPRESSION; POL-II; IDENTIFICATION; CHIP; PREDICTION; MEDIATORS; REVEALS; CELLS;
D O I
10.1371/journal.pone.0024051
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identification of diffuse signals from the chromatin immunoprecipitation and high-throughput massively parallel sequencing (ChIP-Seq) technology poses significant computational challenges, and there are few methods currently available. We present a novel global clustering approach to enrich diffuse CHIP-Seq signals of RNA polymerase II and histone 3 lysine 4 trimethylation (H3K4Me3) and apply it to identify putative long intergenic non-coding RNAs (lincRNAs) in macrophage cells. Our global clustering method compares favorably to the local clustering method SICER that was also designed to identify diffuse CHIP-Seq signals. The validity of the algorithm is confirmed at several levels. First, 8 out of a total of 11 selected putative lincRNA regions in primary macrophages respond to lipopolysaccharides (LPS) treatment as predicted by our computational method. Second, the genes nearest to lincRNAs are enriched with biological functions related to metabolic processes under resting conditions but with developmental and immune-related functions under LPS treatment. Third, the putative lincRNAs have conserved promoters, modestly conserved exons, and expected secondary structures by prediction. Last, they are enriched with motifs of transcription factors such as PU.1 and AP.1, previously shown to be important lineage determining factors in macrophages, and 83% of them overlap with distal enhancers markers. In summary, GCLS based on RNA polymerase II and H3K4Me3 CHIP-Seq method can effectively detect putative lincRNAs that exhibit expected characteristics, as exemplified by macrophages in the study.
引用
收藏
页数:13
相关论文
共 37 条
[21]   Macrophages as mediators of tumor immunosurveillance [J].
Jaiswal, Siddhartha ;
Chao, Mark P. ;
Majeti, Ravindra ;
Weissman, Irving L. .
TRENDS IN IMMUNOLOGY, 2010, 31 (06) :212-219
[22]   An integrated software system for analyzing ChIP-chip and ChIP-seq data [J].
Ji, Hongkai ;
Jiang, Hui ;
Ma, Wenxiu ;
Johnson, David S. ;
Myers, Richard M. ;
Wong, Wing H. .
NATURE BIOTECHNOLOGY, 2008, 26 (11) :1293-1300
[23]   Many human large intergenic noncoding RNAs associate with chromatin-modifying complexes and affect gene expression [J].
Khalil, Ahmad M. ;
Guttman, Mitchell ;
Huarte, Maite ;
Garber, Manuel ;
Raj, Arjun ;
Morales, Dianali Rivea ;
Thomas, Kelly ;
Presser, Aviva ;
Bernstein, Bradley E. ;
van Oudenaarden, Alexander ;
Regev, Aviv ;
Lander, Eric S. ;
Rinn, John L. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (28) :11667-11672
[24]  
LANA X, 2010, 2010 IEEE INT C BIOI, P1
[25]   Catalogues of mammalian long noncoding RNAs: modest conservation and incompleteness [J].
Marques, Ana C. ;
Ponting, Chris P. .
GENOME BIOLOGY, 2009, 10 (11)
[26]   Large-scale identification and characterization of human genes that activate NF-κB and MAPK signaling pathways [J].
Matsuda, A ;
Suzuki, Y ;
Honda, G ;
Muramatsu, S ;
Matsuzaki, O ;
Nagano, Y ;
Doi, T ;
Shimotohno, K ;
Harada, T ;
Nishida, E ;
Hayashi, H ;
Sugano, S .
ONCOGENE, 2003, 22 (21) :3307-3318
[27]  
Maurya MR, 2007, ADV EXP MED BIOL, V598, P62
[28]   Promoter-proximal pol II: When stalling speeds things up [J].
Nechaev, Sergei ;
Adelman, Karen .
CELL CYCLE, 2008, 7 (11) :1539-1544
[29]   Macrophages, Inflammation, and Insulin Resistance [J].
Olefsky, Jerrold M. ;
Glass, Christopher K. .
ANNUAL REVIEW OF PHYSIOLOGY, 2010, 72 :219-246
[30]   Genome-Wide Identification of Long Noncoding RNAs in CD8+ T Cells [J].
Pang, Ken C. ;
Dinger, Marcel E. ;
Mercer, Tim R. ;
Malquori, Lorenzo ;
Grimmond, Sean M. ;
Chen, Weisan ;
Mattick, John S. .
JOURNAL OF IMMUNOLOGY, 2009, 182 (12) :7738-7748