Loading...
Please wait, while we are loading the content...
Similar Documents
Arabic Named Entity Recognition Using Topic Modeling
| Content Provider | Semantic Scholar |
|---|---|
| Author | Bazi, Ismail El Laachfoubi, Nabil |
| Copyright Year | 2018 |
| Abstract | In this article, we introduce novel features for Arabic Named Entity Recognition (NER) based on Latent Dirichlet Allocation (LDA), a widely used topic modeling technique. We investigate and analyze three different approaches for utilizing LDA, including two newly proposed ones, namely Topical Prototypes approach and Topical Word Embeddings approach. Our Experiments show that each of the presented approaches improves the baseline features, among which the Word-Class LDA approach performs the best. Moreover, the combination of these topic modeling approaches provides additive improvements, outperforming traditional word representations as Skip-gram word embeddings and Brown Clustering. The proposed LDA-based features, learned in an unsupervised way, are fully language-independent and have proven to be very effective to enrich and boost NER models for Arabic, a morphologically rich language. |
| Starting Page | 229 |
| Ending Page | 238 |
| Page Count | 10 |
| File Format | PDF HTM / HTML |
| Volume Number | 11 |
| Alternate Webpage(s) | http://www.inass.org/2018/2018022824.pdf |
| Alternate Webpage(s) | https://doi.org/10.22266/ijies2018.0228.24 |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |