湖北农业科学 ›› 2025, Vol. 64 ›› Issue (2): 179-183.doi: 10.14088/j.cnki.issn0439-8114.2025.02.028

• 信息工程 • 上一篇    下一篇

基于CNN-LSTM模型的土壤温湿度缺失数据填补算法

张瑛进1, 史志强1, 古丽米拉·克孜尔别克1, 库木斯·阿依肯2   

  1. 1.新疆农业大学计算机与信息工程学院,乌鲁木齐 830052;
    2.新疆医科大学第七附属医院,乌鲁木齐 830001
  • 收稿日期:2024-07-09 出版日期:2025-02-25 发布日期:2025-03-07
  • 通讯作者: 古丽米拉·克孜尔别克(1970-),女(哈萨克族),新疆昌吉人,副教授,主要从事农业信息化研究,(电话)13899939189(电子信箱)glml@xjau.edu.cn。
  • 作者简介:张瑛进(1999-),女,甘肃武威人,在读硕士研究生,研究方向为农业信息化,(电话)17393828254(电子信箱)2916497264@qq.com。
  • 基金资助:
    科技部科技创新2030重大项目(2022ZD0115800); 新疆维吾尔自治区重大科技专项(2022A02011-4)

Missing data filling algorithm for soil temperature and humidity based on CNN-LSTM Model

ZHANG Ying-jin1, SHI Zhi-qiang1, Gulimila Kezierbieke1, Kumusi Ayiken2   

  1. 1. Computer and Information Engineering College,Xinjiang Agricultural University, Urumqi 830052,China;
    2. The Seventh Affiliated Hospital,Xinjiang Medical University, Urumqi 830001,China
  • Received:2024-07-09 Published:2025-02-25 Online:2025-03-07

摘要: 针对因恶劣环境、电池耗尽、硬件故障等原因导致的土壤温湿度传感器数据丢失问题,提出一种基于卷积神经网络的长短期记忆网络(CNN-LSTM)填补模型。以闪电河流域2019年土壤温湿度数据为试验数据,分别选用CNN、LSTM、TCN、CNN-TCN、CNN-LSTM 5个模型对土壤温湿度传感器网络缺失数据进行填补,并采用Adam算法优化模型,使用决定系数(R2)、均方根误差(RMSE)、平均绝对误差(MAE)指数对模型填补结果进行评价。结果表明,采用线性插补算法获得完整的数据,CNN-LSTM模型的R2为0.999 9,高于其他模型,MAERMSE分别为0.001 85、0.019 70,均远低于其他模型。采用k近邻插补算法获得完整的数据,CNN-LSTM模型的MAERMSE分别为0.000 12、0.000 12,均远低于其他模型,R2为0.999 4,高于CNN模型、TCN模型;CNN-LSTM模型对土壤温湿度传感器数据缺失值的填补效果最好。CNN-LSTM模型在处理大规模土壤温湿度传感器缺失数据问题时具有较好的可行性和精确度。

关键词: CNN-LSTM模型, 土壤, 温湿度, 缺失数据填补算法

Abstract: A convolutional neural network-based long short-term memory network (CNN-LSTM) filling model was proposed to address the problem of soil temperature and humidity sensor data loss caused by harsh environments, battery depletion, hardware failures, and other factors. Using the soil temperature and humidity data from the Shandian River Basin in 2019 as experimental data, five models including CNN, LSTM, TCN, CNN-TCN, and CNN-LSTM were selected to fill in the missing data of the soil temperature and humidity sensor network. The Adam algorithm was used to optimize the model, and the coefficient of determination (R2), mean square root error (RMSE), and mean absolute error (MAE) index were used to evaluate the results of the model filling. The results showed that using the linear interpolation algorithm to obtain complete data, the R2 of the CNN-LSTM model was 0.999 9, which was higher than that of other models. The MAE and RMSE were 0.001 85 and 0.019 70, respectively, which were much lower than those of other models. The K-nearest neighbor interpolation algorithm was used to obtain complete data. The MAE and RMSE of the CNN-LSTM model were 0.000 12 and 0.000 12, respectively, which were much lower than those of other models. The R2 was 0.999 4, which was higher than that of the CNN model, and TCN model;the CNN-LSTM model had the best filling effect on missing values in soil temperature and humidity sensor data. The CNN-LSTM model had good feasibility and accuracy in dealing with the problem of missing data from large-scale soil temperature and humidity sensors.

Key words: CNN-LSTM model, soil, temperature and humidity, missing data filling algorithm

中图分类号: