TY - GEN
T1 - Seedeep
T2 - 21st International Conference on Scientific and Statistical Database Management, SSDBM 2009
AU - Wang, Fan
AU - Agrawal, Gagan
PY - 2009
Y1 - 2009
N2 - A recent and emerging trend in scientific data dissemination involves online databases that are hidden behind query forms, thus forming what is referred to as the deep web. In this paper, we propose SEEDEEP, a System for Exploring and quErying scientific DEEP web data sources. SEEDEEP is able to automatically mine deep web data source schemas, integrate heterogeneous data sources, answer cross-source keyword queries, and incorporates features like caching and fault-tolerance. Currently, SEEDEEP integrates 16 deep web data sources in the biological domain. We demonstrate how an integrated model for correlated deep web data sources is constructed, how a complex cross-source keyword query is answered efficiently and correctly, and how important performance issues are addressed.
AB - A recent and emerging trend in scientific data dissemination involves online databases that are hidden behind query forms, thus forming what is referred to as the deep web. In this paper, we propose SEEDEEP, a System for Exploring and quErying scientific DEEP web data sources. SEEDEEP is able to automatically mine deep web data source schemas, integrate heterogeneous data sources, answer cross-source keyword queries, and incorporates features like caching and fault-tolerance. Currently, SEEDEEP integrates 16 deep web data sources in the biological domain. We demonstrate how an integrated model for correlated deep web data sources is constructed, how a complex cross-source keyword query is answered efficiently and correctly, and how important performance issues are addressed.
UR - http://www.scopus.com/inward/record.url?scp=69049114286&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=69049114286&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-02279-1_6
DO - 10.1007/978-3-642-02279-1_6
M3 - Conference contribution
AN - SCOPUS:69049114286
SN - 3642022782
SN - 9783642022784
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 74
EP - 82
BT - Scientific and Statistical Database Management - 21st International Conference, SSDBM 2009, Proceedings
Y2 - 2 June 2009 through 4 June 2009
ER -