首页|CG-ANER: Enhanced contextual embeddings and glyph features-based agricultural named entity recognition

CG-ANER: Enhanced contextual embeddings and glyph features-based agricultural named entity recognition

扫码查看
? 2022 Elsevier B.V.In recent years, deep learning has greatly improved the performance of named entity recognition models in various fields, especially in the agricultural domain. However, most existing works only utilize word embedding models to generate the context-independent embeddings, which is limited in modeling polysemous words. Moreover, the abundant morphological information in agricultural texts has not been fully utilized. Besides, the local context information needs to be further extracted. To solve the aforementioned issues, a novel enhanced contextual embeddings and glyph features-based model was proposed. First, the contextual embeddings were dynamically generated by the fine-tuned Bidirectional Encoder Representation from Transformers (BERT) on the domain-specific corpus (e.g., agricultural texts), and then the multi-granularity information was obtained from the layers of BERT. Thus, the contextual embeddings not only contain domain-specific knowledge but also include multi-grained semantic information. Second, a novel 3-dimension convolutional neural network-based framework was designed to capture the contextual glyph features for each character from the image perspective. Third, a channel-wised fusion architecture was also introduced to further improve the ability of the convolutional neural network layer to capture local context features. Experimental results showed that our proposed model achieved the best F1-scores of 95.02% and 96.51% on AgCNER and Resume datasets, which indicated the effectiveness and generalization of our model to identify the entities in cross-domain texts. The ablation study in many aspects also demonstrated the better performance of the proposed model.

3-dimension convolutional neural networkAgricultural named entity recognitionBi-directional long short-term memory networkFine-tuning language modelGlyph features

Guo X.、Tang Z.、Bai Z.、Diao L.、Zhou H.、Li L.、Lu S.

展开 >

College of Information and Electrical Engineering China Agricultural University

School of Information University of Michigan

2022

Computers and Electronics in Agriculture

Computers and Electronics in Agriculture

EISCI
ISSN:0168-1699
年,卷(期):2022.194
  • 4
  • 53