PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids

被引:9
作者
Garcia-Remesal, Miguel [1 ,2 ]
Cuevas, Alejandro [2 ]
Perez-Rey, David [1 ,2 ]
Martin, Luis [2 ]
Anguita, Alberto [2 ]
de la Iglesia, Diana [2 ]
de la Calle, Guillermo [2 ]
Crespo, Jose [2 ,3 ]
Maojo, Victor [1 ,2 ]
机构
[1] Univ Politecn Madrid, Fac Informat, Dept Inteligencia Artificial, E-28660 Madrid, Spain
[2] Univ Politecn Madrid, Fac Informat, Biomed Informat Grp, E-28660 Madrid, Spain
[3] Univ Politecn Madrid, Fac Informat, Dept Lenguajes Sistemas Informat & Ingn Software, E-28660 Madrid, Spain
关键词
D O I
10.1093/bioinformatics/btq520
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
PubDNA Finder is an online repository that we have created to link PubMed Central manuscripts to the sequences of nucleic acids appearing in them. It extends the search capabilities provided by PubMed Central by enabling researchers to perform advanced searches involving sequences of nucleic acids. This includes, among other features (i) searching for papers mentioning one or more specific sequences of nucleic acids and (ii) retrieving the genetic sequences appearing in different articles. These additional query capabilities are provided by a searchable index that we created by using the full text of the 176 672 papers available at PubMed Central at the time of writing and the sequences of nucleic acids appearing in them. To automatically extract the genetic sequences occurring in each paper, we used an original method we have developed. The database is updated monthly by automatically connecting to the PubMed Central FTP site to retrieve and index new manuscripts. Users can query the database via the web interface provided.
引用
收藏
页码:2801 / 2802
页数:2
相关论文
共 1 条
[1]  
GARCIAREMESAL M, 2010, P 32 ANN C IEEE ENG