HUBEI AGRICULTURAL SCIENCES ›› 2022, Vol. 61 ›› Issue (18): 196-202.doi: 10.14088/j.cnki.issn0439-8114.2022.18.035

• Information Engineering • Previous Articles     Next Articles

Named entity recognition in agriculture based on BERT embedding and adversarial training

FEI Fan, YANG Lin-nan   

  1. College of Big Data, Yunnan Agricultural University,Kunming 650201, China
  • Received:2021-07-14 Online:2022-09-25 Published:2022-10-21

Abstract: Named entity recognition task for the agricultural domain is a key step in information extraction and question and answer system in the agricultural domain. The goal of this step is to find out the required named entities from a huge amount of unstructured agricultural texts, which usually have challenges such as diverse entity names and contextual semantics missing. To accomplish the task of recognizing agricultural named entities in complex contexts, this paper first constructs an annotated corpus in the agricultural domain, which contains 16 048 samples of six types of entities; then uses the BERT pre-trained language model as the word embedding layer, which can well solve the problem of different semantics and referents of the same word in different contexts compared with the traditional word embedding model; then uses the BiGRU network model for context encoding; finally, the output sequence is annotated using CRF. At the same time, this paper introduces a certain amount of noise to the input data, which is used for adversarial training to improve the generalization and robustness of the model. After the experiments, the accuracy, recall, and F-value of the proposed model are 92.75%, 91.53%, and 92.49%, respectively. Compared with the baseline model, this method has better performance and can effectively identify named entities in the agricultural field.

Key words: agriculture, natural language processing, named entity recognition, information extraction, BERT, BiGRU, adversarial training

CLC Number: