Lastly, there are still issues with the present
CGEC technology, such as how to balance the
minimal edit distance principle's requirement for
sentence fluency while maintaining error correction
accuracy, and how to better optimise the CGEC
model for increased efficiency and accuracy.
challenges. At the same time, because the evaluation
criteria for CGEC tasks are not always consistent,
especially when it comes to sentence structure
adjustments, the establishment of evaluation criteria
is also a difficult problem.
4 CONCLUSION
This article addresses the open issues in the field of
Chinese text error correction and presents the most
recent advances in models and techniques. Generally
speaking, Chinese text error correction technology is
developing rapidly in the two directions of spelling
error correction and grammatical error correction.
New models with better performance are constantly
being produced. Today's CSC models can already
integrate audio and visual information for error
correction, and can consider semantic information to
a certain extent. The great variety and
unpredictability of Chinese, along with the scarcity of
high-quality datasets, continue to pose hurdles to the
performance of today's CGEC models on select
public datasets.
Subsequent studies pertaining to Chinese text
error correction will probably continue to concentrate
on the two methodologies of CSC and CGEC.
Researchers will continue to conduct further research
and improvements on the subtle understanding,
adaptability and generalization capabilities of the
CSC model and CGEC model. In addition, for CGEC
technology, it is quite necessary to have better
training data sets in the future. In addition, there are
relatively few error correction techniques for Chinese
semantics, and allocating funds for this method's
study could encourage the advancement of Chinese
text error correcting methods.
REFERENCES
X. Cheng, W. Xu, K. Chen, et al. Spellgcn: Incorporating
phonological and visual similarities into language
models for chinese spelling check. arXiv preprint
arXiv:2004.14166, (2020).
B. Wang, W. Che, D.Wu, et al. Dynamic connected
networks for Chinese spelling check, Findings of the
Association for Computational Linguistics : 2437-2446
(2021).
C. Li C, C. Zhang, Zheng, et al. Exploration and
exploitation: Two ways to improve Chinese spelling
correction models. arXiv preprint arXiv:2105.14813,
(2021).
S. Zhang, H.Huang, J. Liu, et al. Spelling error correction
with soft-masked BERT. arXiv preprint
arXiv:2005.07421, (2020).
S. Liu S, T. Yang, T. Yue , et al. PLOME: Pre-training with
misspelled knowledge for Chinese spelling correction,
Inter. Joint Conf. Natur. Lang.Proc. 2991-3000 (2021).
L. Huang, J. Li , W. Jiang, et al. PHMOSpell: Phonological
and morphological knowledge guided Chinese spelling
check, Inter. Joint Conf. Natur. Lang.Proc. 5958-5967,
(2021).
L. Liu, H. Wu, H. Zhao. Chinese Spelling Correction as
Rephrasing Language Model. arXiv preprint
arXiv:2308.08796, (2023).
L. Sheng, Z. Xu, X. Li, et al., EDMSpell: Incorporating the
error discriminator mechanism into Chinese spelling
correction for the overcorrection problem. J. King Saud
University-Comput. Inf. Sci., 35, art. no. 101573
(2023).
Z. Yuan, T. Briscoe, Grammatical error correction using
neural machine translation. Proc. Conf. North Am.
Chapter Assoc. Comput. Linguistics: Hum. Lang.
Technol., 380-386 (2016).
H. Kuang, K. Wu, X. Ma, et al., A Chinese Grammatical
Error Correction Method Based on Iterative Training
and Sequence Tagging. Appl. Sci., 12, art. no. 4364
(2022).
Z. Zhao, H. Wang, Maskgec: Improving neural
grammatical error correction via dynamic masking.
Proc. AAAI Conf. Artif. Intell., 34, 1226-1233 (2020).
J. Ye, Y. Li, S. Ma, et al., Focus is what you need for
Chinese grammatical error correction. CoRR,
abs/2210.12692 (2022).
B. Sun, B. Wang, Y. Wang, et al., CSED: A Chinese
Semantic Error Diagnosis Corpus. arXiv preprint
arXiv:2305.05183 (2023).
F. Wang, Z. Xie, An Adversarial Multi-task Learning
Method for Chinese Text Correction with Semantic
Detection. Int. Conf. Artif. Neural Networks, 159-173
(2022).
S. Li, J. Zhao, G. Shi, et al., Chinese grammatical error
correction based on convolutional sequence to
sequence model. IEEE Access, 7, 72905-72913 (2019).