HUBEI AGRICULTURAL SCIENCES ›› 2026, Vol. 65 ›› Issue (5): 172-178.doi: 10.14088/j.cnki.issn0439-8114.2026.05.027

• Agricultural Engineering • Previous Articles     Next Articles

Construction and application of tobacco quality similarity algorithm based on near-infrared spectroscopy

WANG Da-bin1, ZHOU Xian-sheng2, LIU Zhi-guang2, PENG Fu-yu2, YU Wei-song1, QIU Jun1   

  1. 1. Tobacco Research Institute of Chinese Academy of Agricultural Sciences/Laboratory of Quality & Safety Risk Assessment for Tobacco, Ministry of Agriculture and Rural Affairs, Qingdao 266101, Shandong, China;
    2. China Tobacco Shandong Industrial Co., Ltd., Jinan 250000, China
  • Received:2026-03-09 Online:2026-05-25 Published:2026-05-26

Abstract: To achieve the digital and quantitative evaluation of tobacco quality similarity, a similarity measurement algorithm based on near-infrared(NIR) spectroscopy was constructed. Based on the NIR spectral and sensory evaluation data of single-grade tobacco vertical formulation samples from 2022 and 2023, dimensionality reduction was conducted using principal component analysis (PCA). Subsequently, the cosine distance and Euclidean distance were effectively coupled by introducing a kernel function and the L2 norm to calculate and construct the similarity metric matrices and sensory score difference matrices among the samples, followed by a correlation analysis between the two. The results demonstrated that the similarity metric values of 10 and 9 samples in the two respective years exhibited a significant positive correlation (P<0.05) with their sensory score differences, accounting for 66.7% and 60.0% of the total samples in each year, respectively. This indicated that the metric value derived from the model maintained a good consistency with expert sensory evaluations; namely, a larger algorithmic metric value corresponded to a greater sensory difference and a lower similarity between samples. This study provided an effective novel technical approach and quantitative tool for tobacco raw material substitution, auxiliary design of tobacco blends, and stability evaluation of cigarette product quality.

Key words: near-infrared spectroscopy, tobacco quality, similarity metric, kernel function, L2norm, digital evaluation

CLC Number: