NLP-KG
Semantic Search

Publication:

Chinese Spelling Check based on Neural Machine Translation

Chiao-Wen LiJhih-Jie ChenJason J. S. Chang • @Pacific Asia Conference on Language, Information and Computation • 01 January 2018

TLDR: A character-based neural machine translation (NMT) model is trained to translate a potentially misspelled sentence into correct one, using right-and-wrong sentence pairs from newspaper edit logs and artificially generated data.

Citations: 7
Abstract: This paper presents a method for Chinese spelling check that automatically learns to correct a sentence with potential spelling errors. In our approach, a character-based neural machine translation (NMT) model is trained to translate a potentially misspelled sentence into correct one, using right-and-wrong sentence pairs from newspaper edit logs and artificially generated data. The method involves extracting sentences containing edits of spelling correction from edit logs, using commonly confused right-and-wrong word pairs to generate artificial right-and-wrong sentence pairs in order to expand our training data, and training the NMT model. The evaluation on the United Daily News (UDN) Edit Logs and SIGHAN-7 Shared Task shows that adding artificial error data significantly improves the performance of Chinese spelling check system.

Related Fields of Study

loading

Citations

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next

References

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next