WebSite Logo
  • Content
  • Similar Resources
  • Metadata
  • Cite This
  • Log-in
  • Fullscreen
Log-in
Do not have an account? Register Now
Forgot your password? Account recovery
  1. Transactions on Asian and Low-Resource Language Information Processing (TALLIP)
  2. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) : Volume 14
  3. Issue 4(Special Issue on Chinese Spell Checking), October 2015
  4. A Probabilistic Framework for Chinese Spelling Check
Loading...

Please wait, while we are loading the content...

ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) : Volume 16
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) : Volume 15
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) : Volume 14
Issue 4(Special Issue on Chinese Spell Checking), October 2015
TALLIP Perspectives: Editorial Commentary: The State of the Journal
Introduction to the Special Issue on Chinese Spell Checking
A Probabilistic Framework for Chinese Spelling Check
A Hybrid Ranking Approach to Chinese Spelling Check
Chinese Spelling Checker Based on an Inverted Index List with a Rescoring Mechanism
Correcting Chinese Spelling Errors with Word Lattice Decoding
Issue 3, June 2015
Issue 2, March 2015
Issue 1, January 2015

Similar Documents

...
A Hybrid Ranking Approach to Chinese Spelling Check

Article

...
RSpell: Retrieval-augmented Framework for Domain Adaptive Chinese Spelling Check

Article

...
A Chinese Spelling Check Framework Based on Reverse Contrastive Learning

Article

...
A hybrid language model based on a recurrent neural network and probabilistic topic modeling

Article

...
Correcting Chinese Spelling Errors with Word Lattice Decoding

Article

...
A study of language modeling for chinese spelling check.

Article

...
Chinese Spelling Checker Based on an Inverted Index List with a Rescoring Mechanism

Article

...
A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check

Article

...
A chinese ocr spelling check approach based on statistical language models ∗.

Article

A Probabilistic Framework for Chinese Spelling Check

Content Provider ACM Digital Library
Author Wang, Hsin-Min Chen, Hsin-Hsi Chen, Kuan-Yu
Copyright Year 2015
Abstract Chinese spelling check (CSC) is still an unsolved problem today since there are many homonymous or homomorphous characters. Recently, more and more CSC systems have been proposed. To the best of our knowledge, language modeling is one of the major components among these systems because of its simplicity and moderately good predictive power. After deeply analyzing the school of research, we are aware that most of the systems only employ the conventional $\textit{n}-gram$ language models. The contributions of this article are threefold. First, we propose a novel probabilistic framework for CSC, which naturally combines several important components, such as the substitution model and the language model, to inherit their individual merits as well as to overcome their limitations. Second, we incorporate the topic language models into the CSC system in an unsupervised fashion. The topic language models can capture the long-span semantic information from a word (character) string while the conventional $\textit{n}-gram$ language models can only preserve the local regularity information. Third, we further integrate Web resources with the proposed framework to enhance the overall performance. Our rigorously empirical experiments demonstrate the consistent and utility performance of the proposed framework in the CSC task.
Starting Page 1
Ending Page 17
Page Count 17
File Format PDF
ISSN 23754699
e-ISSN 23754702
DOI 10.1145/2826234
Volume Number 14
Issue Number 4
Journal ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP)
Language English
Publisher Association for Computing Machinery (ACM)
Publisher Date 2015-11-11
Publisher Place New York
Access Restriction One Nation One Subscription (ONOS)
Subject Keyword Chinese Language model Probabilistic Spelling check Topic modeling
Content Type Text
Resource Type Article
Subject Computer Science
  • About
  • Disclaimer
  • Feedback
  • Sponsor
  • Contact
  • Chat with Us
About National Digital Library of India (NDLI)
NDLI logo

National Digital Library of India (NDLI) is a virtual repository of learning resources which is not just a repository with search/browse facilities but provides a host of services for the learner community. It is sponsored and mentored by Ministry of Education, Government of India, through its National Mission on Education through Information and Communication Technology (NMEICT). Filtered and federated searching is employed to facilitate focused searching so that learners can find the right resource with least effort and in minimum time. NDLI provides user group-specific services such as Examination Preparatory for School and College students and job aspirants. Services for Researchers and general learners are also provided. NDLI is designed to hold content of any language and provides interface support for 10 most widely used Indian languages. It is built to provide support for all academic levels including researchers and life-long learners, all disciplines, all popular forms of access devices and differently-abled learners. It is designed to enable people to learn and prepare from best practices from all over the world and to facilitate researchers to perform inter-linked exploration from multiple sources. It is developed, operated and maintained from Indian Institute of Technology Kharagpur.

Learn more about this project from here.

Disclaimer

NDLI is a conglomeration of freely available or institutionally contributed or donated or publisher managed contents. Almost all these contents are hosted and accessed from respective sources. The responsibility for authenticity, relevance, completeness, accuracy, reliability and suitability of these contents rests with the respective organization and NDLI has no responsibility or liability for these. Every effort is made to keep the NDLI portal up and running smoothly unless there are some unavoidable technical issues.

Feedback

Sponsor

Ministry of Education, through its National Mission on Education through Information and Communication Technology (NMEICT), has sponsored and funded the National Digital Library of India (NDLI) project.

Contact National Digital Library of India
Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302
See location in the Map
03222 282435
Mail: support@ndl.gov.in
Sl. Authority Responsibilities Communication Details
1 Ministry of Education (GoI),
Department of Higher Education
Sanctioning Authority https://www.education.gov.in/ict-initiatives
2 Indian Institute of Technology Kharagpur Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project https://www.iitkgp.ac.in
3 National Digital Library of India Office, Indian Institute of Technology Kharagpur The administrative and infrastructural headquarters of the project Dr. B. Sutradhar  bsutra@ndl.gov.in
4 Project PI / Joint PI Principal Investigator and Joint Principal Investigators of the project Dr. B. Sutradhar  bsutra@ndl.gov.in
Prof. Saswat Chakrabarti  will be added soon
5 Website/Portal (Helpdesk) Queries regarding NDLI and its services support@ndl.gov.in
6 Contents and Copyright Issues Queries related to content curation and copyright issues content@ndl.gov.in
7 National Digital Library of India Club (NDLI Club) Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach clubsupport@ndl.gov.in
8 Digital Preservation Centre (DPC) Assistance with digitizing and archiving copyright-free printed books dpc@ndl.gov.in
9 IDR Setup or Support Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops idr@ndl.gov.in
I will try my best to help you...
Cite this Content
Loading...