HUBEI AGRICULTURAL SCIENCES ›› 2022, Vol. 61 ›› Issue (16): 186-192.doi: 10.14088/j.cnki.issn0439-8114.2022.16.036

• Information Engineering • Previous Articles     Next Articles

Research on question generation technology in agricultural field based on NEZHA-UniLM model

LI Fen, FEI Fan, PENG Lin   

  1. College of Big Data,Yunnan Agricultural University, Kunming 650000, China
  • Received:2022-06-17 Online:2022-08-25 Published:2022-09-14

Abstract: To address the lack of question and answer datasets in the agricultural domain and the fact that most end-to-end models are currently used for question generation tasks, after a series of data crawling, cleaning, filtering and annotation, a question generation dataset in agricultural domain was constructed; and the question generation in agricultural domain based on NEZHA-UniLM pre-training model was studied, for the cumulative error phenomenon caused by exposure error, adversarial training to generate perturbed samples to alleviate the problem was introduced. Compared with other benchmark models, the BLEU-4 and Rouge-L of the NEZHA-UniLM model reached 0.383 0 and 0.583 9. Compared with the pre-trained model without adversarial training, the BLEU-4 and Rouge-L were improved by 0.068 9 and 0.113 8, respectively. BLEU_4 and Rouge-L were improved by 0.195 3 and 0.151 7, respectively. The experimental results showed that the model not only effectively alleviated the problems of low matching between generated questions and answers, missing or multiple words in generated questions and exposure errors, but also effectively improved the quality of generated questions.

Key words: natural language processing, NEZHA-UniLM pre-training model, adversarial training, problem generation

CLC Number: