Loading...
Please wait, while we are loading the content...
Similar Documents
A portable method for acquiring information extraction patterns without annotated corpora
| Content Provider | Scilit |
|---|---|
| Author | Català, Neus Castell, Núria Martín, Mario |
| Copyright Year | 2003 |
| Description | The main issue when building Information Extraction (IE) systems is how to obtain the knowledge needed to identify relevant information in a document. Most approaches require expert human intervention in many steps of the acquisition process. In this paper we describe ESSENCE, a new method for acquiring IE patterns that significantly reduces the need for human intervention. The method is based on ELA, a specifically designed learning algorithm for acquiring IE patterns without tagged examples. The distinctive features of ESSENCE and ELA are that (1) they permit the automatic acquisition of IE patterns from unrestricted and untagged text representative of the domain, due to (2) their ability to identify regularities around semantically relevant concept-words for the IE task by (3) using non-domain-specific lexical knowledge tools such as WordNet, and (4) restricting the human intervention to defining the task, and validating and typifying the set of IE patterns obtained. Since ESSENCE does not require a corpus annotated with the type of information to be extracted and it uses a general purpose ontology and widely applied syntactic tools, it reduces the expert effort required to build an IE system and therefore also reduces the effort of porting the method to any domain. The results of the application of ESSENCE to the acquisition of IE patterns in an MUC-like task are shown. |
| Related Links | https://www.cambridge.org/core/services/aop-cambridge-core/content/view/F8CE7B7766606D3F915C5D7CE1E285CA/S1351324902003042a.pdf/div-class-title-a-portable-method-for-acquiring-information-extraction-patterns-without-annotated-corpora-div.pdf |
| Ending Page | 179 |
| Page Count | 29 |
| Starting Page | 151 |
| ISSN | 13513249 |
| e-ISSN | 14698110 |
| DOI | 10.1017/s1351324902003042 |
| Journal | Natural Language Engineering |
| Issue Number | 2 |
| Volume Number | 9 |
| Language | English |
| Publisher | Cambridge University Press (CUP) |
| Publisher Date | 2003-06-01 |
| Access Restriction | Open |
| Subject Keyword | Natural Language Engineering Language Studies Information Extraction Human Intervention Method for Acquiring Acquiring Ie Patterns |
| Content Type | Text |
| Resource Type | Article |
| Subject | Artificial Intelligence Linguistics and Language Software |