Current Chinese named-entity recognition(NER)models have achieved remarkable results on public datasets.However,some studies suggest that they rely too heavily on literal features of entity text.Moreover,the influence of context on entity recognition has yet to be fully explored.Existing models perform poorly in simple invariance tests.To address this problem,this paper proposes explicitly modeling the context independently,enabling the model to differentiate between contextual information and the literal information of entities.Additionally,an adapted data en-hancement method is introduced to train the context,surface name,and combination modules.Experimental results show that this approach significantly improves the model's performance in the invariance test without sacrificing recognition performance,reducing the failure rate by 2.3%compared with the benchmark model.
关键词
自然语言处理/中文命名实体识别(NER)/上下文独立建模/数据增强
Key words
natural language processing/Chinese named-entity recognition(NER)/independent context modeling/data augmentation