Loading...
Please wait, while we are loading the content...
Similar Documents
Search Results From the Web Databases Using Ontology-Assisted Data Extraction
| Content Provider | CiteSeerX |
|---|---|
| Author | Jyothi, J. Siva Reddy, Ch. Satyananada |
| Abstract | Abstract — With the help of HTML form-based search interfaces, a large number of databases have become web accessible. For the sake of human browsing, the data units from the underlying database are decoded into the result pages very dynamically. The encoded data units need to be machine processable. It is very important for many applications such as deep web data collection and Internet applications, and are extracted and meaningful labels are assigned. It is accessible data extraction method, ODE (Ontology-assisted Data Extraction), which automatically extracts the query result records from the HTML pages. ODE first constructs ontology for a domain according to information matching between the query interfaces and query result pages from different web sites within the same domain. Then, the constructed domain ontology is used during data extraction to identify the query result section in a query result page and to align and label the data values in the extracted records. The ontology assisted data extraction method is fully automatic and overcomes many of the deficiencies of current automatic data extraction methods. The annotation wrapper which is automatically constructed for that particular site can be used to annotate new result pages from the same web database. By the test results it is found that this approach is good and effective. |
| File Format | |
| Access Restriction | Open |
| Subject Keyword | Search Result Query Result Page Html Form-based Search Interface Query Interface Human Browsing New Result Page Meaningful Label Test Result Domain Ontology Different Web Site Data Unit Data Extraction Particular Site Extracted Record Internet Application Underlying Database Annotation Wrapper Data Extraction Method Data Value Encoded Data Unit Query Result Record Result Page Ode First Construct Ontology Deep Web Data Collection Accessible Data Extraction Method Current Automatic Data Extraction Method Ontology-assisted Data Extraction Web Database Query Result Section Many Application Html Page |
| Content Type | Text |