首页|Convolution Neural Network with Active Learning for Information Extraction of Enterprise Announcements
Convolution Neural Network with Active Learning for Information Extraction of Enterprise Announcements
扫码查看
点击上方二维码区域,可以放大扫码查看
原文链接
NETL
Springer Nature
We propose using convolution neural network (CNN) with active learning for information extraction of enterprise announcements。 The training process of supervised deep learning model usually requires a large amount of training data with high-quality reference samples。 Human production of such samples is tedious, and since inter-labeler agreement is low, very unreliable。 Active learning helps assuage this problem by automatically selecting a small amount of unlabeled samples for humans to hand correct。 Active learning chooses a selective set of samples to be labeled。 Then the CNN is trained on the labeled data iteratively, until the expected experimental effect is achieved。 We propose three sample selection methods based on certainty criterion。 We also establish an enterprise announcements dataset for experiments, which contains 10410 samples totally。 Our experiment results show that the amount of labeled data needed for a given extraction accuracy can be reduced by more than 45。79% compared to that without active learning。
Text classificationActive learningConvolutional neural networksEnterprise announcements
Lei Fu、Zhaoxia Yin、Yi Liu、Jun Zhang
展开 >
Key Laboratory of Intelligent Computing and Signal Processing, Ministry of Education, Anhui University, Hefei 230601, People's Republic of China,PKU Shenzhen Institute, Shenzhen, China
Key Laboratory of Intelligent Computing and Signal Processing, Ministry of Education, Anhui University, Hefei 230601, People's Republic of China
PKU Shenzhen Institute, Shenzhen, China
Shenzhen Securities Information, Co., Ltd., Shenzhen, China
展开 >
CCF international conference on natural language processing and Chinese computing