Hubbry Logo
search button
Sign in
Heterogeneous database system
Heterogeneous database system
Comunity Hub
History
arrow-down
starMore
arrow-down
bob

Bob

Have a question related to this hub?

bob

Alice

Got something to say related to this hub?
Share it here.

#general is a chat channel to discuss anything related to the hub.
Hubbry Logo
search button
Sign in
Heterogeneous database system
Community hub for the Wikipedia article
logoWikipedian hub
Welcome to the community hub built on top of the Heterogeneous database system Wikipedia article. Here, you can discuss, collect, and organize anything related to Heterogeneous database system. The purpos...
Add your contribution
Heterogeneous database system

A heterogeneous database system is an automated (or semi-automated) system for the integration of heterogeneous, disparate database management systems to present a user with a single, unified query interface.

Heterogeneous database systems (HDBs) are computational models and software implementations that provide heterogeneous database integration.[1][2]

Problems of heterogeneous database integration

[edit]

This article does not contain details of distributed database management systems (sometimes known as federated database systems).

Technical heterogeneity

[edit]

Different file formats, access protocols, query languages etc. Often called syntactic heterogeneity from the point of view of data.

Data model heterogeneity

[edit]

Different ways of representing and storing the same data. Table decompositions may vary, column names (data labels) may be different (but have the same semantics), data encoding schemes may vary (i.e., should a measurement scale be explicitly included in a field or should it be implied elsewhere). Also referred as schematic heterogeneity.

Semantic heterogeneity

[edit]

Data across constituent databases may be related but different. Perhaps a database system must be able to integrate genomic and proteomic data. They are related—a gene may have several protein products—but the data are different (nucleotide sequences and amino acid sequences, or hydrophilic or -phobic amino acid sequence and positively or negatively charged amino acids). There may be many ways of looking at semantically similar, but distinct, datasets.

The system may also be required to present "new" knowledge to the user. Relationships may be inferred between data according to rules specified in domain ontologies.

See also

[edit]

References

[edit]
  1. ^ Sujansky, Walter (August 2001). "Heterogeneous Database Integration in Biomedicine". Journal of Biomedical Informatics. 34 (4): 285–298. doi:10.1006/jbin.2001.1024. PMID 11977810.
  2. ^ Sheth, Amit P.; James A. Larson (September 1990). "Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases" (PDF). ACM Computing Surveys. 22 (3): 183–236. CiteSeerX 10.1.1.381.9176. doi:10.1145/96602.96604. Archived from the original (PDF) on 2019-10-24. Retrieved 2012-07-30.