Loading...
Please wait, while we are loading the content...
Similar Documents
Identification of Personal Name Aliases on the Web
| Content Provider | Semantic Scholar |
|---|---|
| Author | Bollegala, Danushka Honma, Taiki Matsuo, Yutaka Ishizuka, Mitsuru |
| Copyright Year | 2008 |
| Abstract | Extracting aliases of an entity is important for various tasks such as identiflcation of relations among entities, web search and entity disambiguation. To extract relations among entities properly, one must flrst identify those entities. We propose a novel approach to flnd aliases of a given name using automatically extracted lexical patterns. We exploit a set of known names and their aliases as training data and extract lexical patterns that convey information related to aliases of names from text snippets returned by a web search engine. The patterns are then used to flnd candidate aliases of a given name. We use anchor texts to design a word cooccurrence model and use it to deflne various ranking scores to measure the association between a name and a candidate alias. The ranking scores are integrated with page-countbased association measures using support vector machines to leverage a robust alias detection method. The proposed method outperforms numerous baselines and previous work on alias extraction on a dataset of personal names, achieving a statistically signiflcant mean reciprocal rank of 0:6718. Experiments carried out using a dataset of location names and Japanese personal names suggest the possibility of extending the proposed method to extract aliases for difierent types of named entities and for other languages. Moreover, the aliases extracted using the proposed method improve recall by 20% in a relation-detection task. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://danushka.net/papers/SWSM08.pdf |
| Alternate Webpage(s) | http://keg.cs.tsinghua.edu.cn/SWSM2008/regular%20papers/swsm08_submission_11.pdf |
| Alternate Webpage(s) | http://www.miv.t.u-tokyo.ac.jp/papers/danushka-WWW08-SWSM08.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |