Relax with CouchDB - Into the non-relational DBMS era of bioinformatics

Ganiraju Manyam; Michelle A. Payton; Jack A. Roth; Lynne V. Abruzzo; Kevin R. Coombes

doi:10.1016/j.ygeno.2012.05.006

Relax with CouchDB - Into the non-relational DBMS era of bioinformatics

Ganiraju Manyam, Michelle A. Payton, Jack A. Roth, Lynne V. Abruzzo, Kevin R. Coombes

Research output: Contribution to journal › Article › peer-review

27 Scopus citations

Abstract

With the proliferation of high-throughput technologies, genome-level data analysis has become common in molecular biology. Bioinformaticians are developing extensive resources to annotate and mine biological features from high-throughput data. The underlying database management systems for most bioinformatics software are based on a relational model. Modern non-relational databases offer an alternative that has flexibility, scalability, and a non-rigid design schema. Moreover, with an accelerated development pace, non-relational databases like CouchDB can be ideal tools to construct bioinformatics utilities. We describe CouchDB by presenting three new bioinformatics resources: (a) geneSmash, which collates data from bioinformatics resources and provides automated gene-centric annotations, (b) drugBase, a database of drug-target interactions with a web interface powered by geneSmash, and (c) HapMap-CN, which provides a web interface to query copy number variations from three SNP-chip HapMap datasets. In addition to the web sites, all three systems can be accessed programmatically via web services.

Original language	English (US)
Pages (from-to)	1-7
Number of pages	7
Journal	Genomics
Volume	100
Issue number	1
DOIs	https://doi.org/10.1016/j.ygeno.2012.05.006
State	Published - Jul 2012
Externally published	Yes

Keywords

Copy number variation
Data integration
Drug-target interaction
NoSQL database

ASJC Scopus subject areas

Genetics

Access to Document

10.1016/j.ygeno.2012.05.006

Cite this

@article{4b08ed5b4a614c1388e4b1b70afbcde7,

title = "Relax with CouchDB - Into the non-relational DBMS era of bioinformatics",

abstract = "With the proliferation of high-throughput technologies, genome-level data analysis has become common in molecular biology. Bioinformaticians are developing extensive resources to annotate and mine biological features from high-throughput data. The underlying database management systems for most bioinformatics software are based on a relational model. Modern non-relational databases offer an alternative that has flexibility, scalability, and a non-rigid design schema. Moreover, with an accelerated development pace, non-relational databases like CouchDB can be ideal tools to construct bioinformatics utilities. We describe CouchDB by presenting three new bioinformatics resources: (a) geneSmash, which collates data from bioinformatics resources and provides automated gene-centric annotations, (b) drugBase, a database of drug-target interactions with a web interface powered by geneSmash, and (c) HapMap-CN, which provides a web interface to query copy number variations from three SNP-chip HapMap datasets. In addition to the web sites, all three systems can be accessed programmatically via web services.",

keywords = "Copy number variation, Data integration, Drug-target interaction, NoSQL database",

author = "Ganiraju Manyam and Payton, {Michelle A.} and Roth, {Jack A.} and Abruzzo, {Lynne V.} and Coombes, {Kevin R.}",

note = "Funding Information: This work was supported in part by grants R01 CA123252 , P30 CA016672 , and P50 CA070907 from the National Cancer Institute of the National Institutes of Health . ",

year = "2012",

month = jul,

doi = "10.1016/j.ygeno.2012.05.006",

language = "English (US)",

volume = "100",

pages = "1--7",

journal = "Genomics",

issn = "0888-7543",

publisher = "Academic Press Inc.",

number = "1",

}

TY - JOUR

T1 - Relax with CouchDB - Into the non-relational DBMS era of bioinformatics

AU - Manyam, Ganiraju

AU - Payton, Michelle A.

AU - Roth, Jack A.

AU - Abruzzo, Lynne V.

AU - Coombes, Kevin R.

N1 - Funding Information: This work was supported in part by grants R01 CA123252 , P30 CA016672 , and P50 CA070907 from the National Cancer Institute of the National Institutes of Health .

PY - 2012/7

Y1 - 2012/7

N2 - With the proliferation of high-throughput technologies, genome-level data analysis has become common in molecular biology. Bioinformaticians are developing extensive resources to annotate and mine biological features from high-throughput data. The underlying database management systems for most bioinformatics software are based on a relational model. Modern non-relational databases offer an alternative that has flexibility, scalability, and a non-rigid design schema. Moreover, with an accelerated development pace, non-relational databases like CouchDB can be ideal tools to construct bioinformatics utilities. We describe CouchDB by presenting three new bioinformatics resources: (a) geneSmash, which collates data from bioinformatics resources and provides automated gene-centric annotations, (b) drugBase, a database of drug-target interactions with a web interface powered by geneSmash, and (c) HapMap-CN, which provides a web interface to query copy number variations from three SNP-chip HapMap datasets. In addition to the web sites, all three systems can be accessed programmatically via web services.

AB - With the proliferation of high-throughput technologies, genome-level data analysis has become common in molecular biology. Bioinformaticians are developing extensive resources to annotate and mine biological features from high-throughput data. The underlying database management systems for most bioinformatics software are based on a relational model. Modern non-relational databases offer an alternative that has flexibility, scalability, and a non-rigid design schema. Moreover, with an accelerated development pace, non-relational databases like CouchDB can be ideal tools to construct bioinformatics utilities. We describe CouchDB by presenting three new bioinformatics resources: (a) geneSmash, which collates data from bioinformatics resources and provides automated gene-centric annotations, (b) drugBase, a database of drug-target interactions with a web interface powered by geneSmash, and (c) HapMap-CN, which provides a web interface to query copy number variations from three SNP-chip HapMap datasets. In addition to the web sites, all three systems can be accessed programmatically via web services.

KW - Copy number variation

KW - Data integration

KW - Drug-target interaction

KW - NoSQL database

UR - http://www.scopus.com/inward/record.url?scp=84862753629&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84862753629&partnerID=8YFLogxK

U2 - 10.1016/j.ygeno.2012.05.006

DO - 10.1016/j.ygeno.2012.05.006

M3 - Article

C2 - 22609849

AN - SCOPUS:84862753629

SN - 0888-7543

VL - 100

SP - 1

EP - 7

JO - Genomics

JF - Genomics

IS - 1

ER -

Relax with CouchDB - Into the non-relational DBMS era of bioinformatics

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this