Loading...
Please wait, while we are loading the content...
Similar Documents
Efficient Management of XML Documents Extended Abstract
| Content Provider | CiteSeerX |
|---|---|
| Author | Lu, Hongjun |
| Abstract | XML has become a de facto standard for data representation and exchange on the World-Wide-Web. Unlike HTML tags that are mainly used to describe presentations, tags in XML capture some semantics, especially when domain-specific common DTDs are used when authoring XML documents. Since industries are indeed enthusiastic about XML, and more and more XML documents have been generated, we have to deal with the issues related to efficient management of XML documents. Storing XML documents. A number of approaches have been proposed to store XML documents. Those approaches can be categorized along two dimensions. One dimension is how an XML document is modelled. XML documents can be managed as text files, or by a DBMS, or by a native XML engine. When managed as text files, they are viewed as character strings. When DBMS is used, they are transformed to conform a specific data model, e.g., the relational model. Most native XML engines use trees to model XML documents since elements in an XML document are ordered and strictly nested. Orthogonal to this dimension is whether DTD is used in the storage model. For example, when XML data is stored in a relational system, the relational schema can be generated either using or not using the type information of elements in DTD. When the schema is generated based on the DTDs, XML documents with different DTDs will have different schemas, hence the schema will be document dependent. On the other hand, since any XML document can be modelled as an ordered tree, a relational schema that is able to describe the tree structure, and the position of elements in such a structure is sufficient. Using this approach, no DTD information is required, and all XML documents will have the same relational schema. That is, such schema is document independent. Recently, we conducted a benchmarking study to investigate the comparative performance of various schema mapping and storage methods implemented in three experimental XML database systems: |
| File Format | |
| Access Restriction | Open |
| Subject Keyword | Xml Document Efficient Management Xml Document Extended Abstract Relational Schema Text File Native Xml Engine Ordered Tree Type Information Xml Data Various Schema Mapping Benchmarking Study Html Tag Relational Model Comparative Performance Character String Storage Method Xml Capture Dtd Information Document Dependent Specific Data Model Experimental Xml Database System Data Representation Storage Model Domain-specific Common Dtds Different Dtds Tree Structure Relational System Different Schema |
| Content Type | Text |