Loading...
Please wait, while we are loading the content...
Similar Documents
A Survey on Unsupervised Extraction of Product Information from Semi-Structured Sources
| Content Provider | Semantic Scholar |
|---|---|
| Author | Bhagat, Abhilasha Ramdas Raut, Vanita |
| Copyright Year | 2015 |
| Abstract | Now a days, searching Product information has become one of the most important application areas of the Internet. However, the large amount of data is available about the products on web and its various representations may easily overstrain potential customers. Online product information is generally semi-structured format because it is represented through template-generated HTML pages. Such pages only follow an implicit data schema since information and presentation part are mixed up. Goal of the information extraction task is to access the product information in a structured manner efficiently. The proposed technique is based on a clustering approach that uses structural and visual features of web page elements. The information which has been extracted allow user to effectively compare products while saving the manual extraction work. Here, in this survey paper we have described various wrapper generation system such as RoadRunner, SG-WRAP, X-WRAP, DeLA in detail with their advantages and limitation. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://www.ijcsit.com/docs/Volume%206/vol6issue03/ijcsit20150603199.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |