Chinese passive sentences can be classified into marked and unmarked passive sentences based on the pres-ence of passive markers.Due to their complex and diverse forms,they pose significant challenges to natural language understanding.Therefore,the automatic recognition of Chinese passive sentences is important for downstream tasks in natural language processing.In this paper,we construct a corpus specifically for passive sentences and propose a PC-BERT-CNN model that integrates part-of-speech and verb argument frame information to automatic Chinese passive sentence identification.Experiment results demonstrate the proposed model achieves 98.77%F1 score for marked passive sentence recognition,and 96.72%for unmarked passive sentence recognition.
关键词
汉语被动句/自动识别/特征融合/语料库
Key words
Chinese passive sentences/automatic recognition/feature fusion/corpus