首页|Convolution Neural Network with Active Learning for Information Extraction of Enterprise Announcements

Convolution Neural Network with Active Learning for Information Extraction of Enterprise Announcements

扫码查看
We propose using convolution neural network (CNN) with active learning for information extraction of enterprise announcements。 The training process of supervised deep learning model usually requires a large amount of training data with high-quality reference samples。 Human production of such samples is tedious, and since inter-labeler agreement is low, very unreliable。 Active learning helps assuage this problem by automatically selecting a small amount of unlabeled samples for humans to hand correct。 Active learning chooses a selective set of samples to be labeled。 Then the CNN is trained on the labeled data iteratively, until the expected experimental effect is achieved。 We propose three sample selection methods based on certainty criterion。 We also establish an enterprise announcements dataset for experiments, which contains 10410 samples totally。 Our experiment results show that the amount of labeled data needed for a given extraction accuracy can be reduced by more than 45。79% compared to that without active learning。

Text classificationActive learningConvolutional neural networksEnterprise announcements

Lei Fu、Zhaoxia Yin、Yi Liu、Jun Zhang

展开 >

Key Laboratory of Intelligent Computing and Signal Processing, Ministry of Education, Anhui University, Hefei 230601, People's Republic of China,PKU Shenzhen Institute, Shenzhen, China

Key Laboratory of Intelligent Computing and Signal Processing, Ministry of Education, Anhui University, Hefei 230601, People's Republic of China

PKU Shenzhen Institute, Shenzhen, China

Shenzhen Securities Information, Co., Ltd., Shenzhen, China

展开 >

CCF international conference on natural language processing and Chinese computing

Hohhot(CN)

Natural language processing and Chinese computing

330-339

2018