Loading...
Please wait, while we are loading the content...
Similar Documents
Linguistic Annotation of Translated Chinese Texts: Coordinating Theory, Algorithms and Data
| Content Provider | Scilit |
|---|---|
| Author | Semenov, Kirill I. Titizian, Armine K. Piskunova, Aleksandra O. Korotkova, Yulia O. Tsvetkova, Alena D. Volf, Elena A. Konovalova, Alexandra S. Kuznetsova, Yulia N. |
| Copyright Year | 2021 |
| Abstract | The article tackles the problems of linguistic annotation in the Chinese texts presented in the Ruzhcorp – Russian-Chinese Parallel Corpus of RNC, and the ways to solve them. Particular attention is paid to the processing of Russian loanwords. On the one hand, we present the theoretical comparison of the widespread standards of Chinese text processing. On the other hand, we describe our experiments in three fields: word segmentation, grapheme-to-phoneme conversion, and PoS-tagging, on the specific corpus data that contains many transliterations and loanwords. As a result, we propose the preprocessing pipeline of the Chinese texts, that will be implemented in Ruzhcorp. |
| Related Links | https://www.sciendo.com/pdf/10.2478/jazcas-2021-0054 |
| Ending Page | 602 |
| Page Count | 13 |
| Starting Page | 590 |
| ISSN | 00215597 |
| e-ISSN | 13384287 |
| DOI | 10.2478/jazcas-2021-0054 |
| Journal | Journal of Linguistics/Jazykovedný casopis |
| Issue Number | 2 |
| Volume Number | 72 |
| Language | English |
| Publisher | Walter de Gruyter GmbH |
| Publisher Date | 2021-12-01 |
| Access Restriction | Open |
| Subject Keyword | Journal of Linguistics/jazykovedný Casopis Language Studies Annotation Chinese Texts Linguistic Conversion Coordinating Word Theoretical Pos Journal: Journal of Linguistics/Jazykovedný casopis, Vol- 72, Issue- 1 |
| Content Type | Text |
| Resource Type | Article |
| Subject | Linguistics and Language |