Open Research Newcastle
Browse

Querying semistructured data with compression in distributed environments

Download (345.73 kB)
conference contribution
posted on 2025-05-11, 22:45 authored by B. M. Monjurul Alom, Frans HenskensFrans Henskens, Michael Hannaford
As data management applications grow more complex, they are likely to need efficient distributed query processing. In Distributed Database Systems complete replication consists of maintaining complete copies of the database at each site; this has advantages such as highest locality of reference, highest reliability, availability, and is best for reading. The most promising and dominant data format for data processing and representing on the Internet is the semistructured data form termed XML. XML data has no fixed schema; it evolved and is self describing which results in management difficulties compared to, for example relational data. It is therefore a major challenge for the database community to design query languages and storage methods that can retrieve semistructured data. In this paper, we present a storing and querying scheme for semistructured data views of relational form in distributed environments. The proposed technique stores path dictionary, word dictionary, attribute dictionary, and the complete compressed replication of semistructured data in each distributed site of the DDBS. The presented technique provides query performance improvement due to the compression of semistructured data.

History

Source title

Proceedings of the 6th International Conference on Information Technology: New Generations 2009

Name of conference

6th International Conference on Information Technology: New Generations, 2009 (ITNG '09)

Location

Las Vegas, NV

Start date

2009-04-27

End date

2009-04-29

Pagination

1546-1553

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Place published

Piscataway, NJ

Language

  • en, English

College/Research Centre

Faculty of Engineering and Built Environment

School

School of Electrical Engineering and Computer Science

Rights statement

Copyright © 2009 IEEE. Reprinted from the Proceedings of the 6th International Conference on Information Technology: New Generations 2009. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of University of Newcastle's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

Usage metrics

    Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC