您好,欢迎访问云南省农业科学院 机构知识库!

ATR-FTIR Spectroscopy Preprocessing Technique Selection for Identification of Geographical Origins of Gastrodia elata Blume

文献类型: 外文期刊

作者: Liu, Hong 1 ; Liu, Honggao 3 ; Li, Jieqing 1 ; Wang, Yuanzhong 2 ;

作者机构: 1.Yunnan Agr Univ, Coll Agron & Biotechnol, Kunming, Peoples R China

2.Yunnan Acad Agr Sci, Med Plants Res Inst, Kunming, Peoples R China

3.Zhaotong Univ, Yunnan Key Lab Gastrodia & Fungi Symbiot Biol, Zhaotong, Yunnan, Peoples R China

关键词: ATR-FTIR spectroscopy; data preprocessing; DD-SIMCA; Gastrodia elata Blume; GBM; PLS-DA; SVM

期刊名称:JOURNAL OF CHEMOMETRICS ( 影响因子:1.9; 五年影响因子:2.2 )

ISSN: 0886-9383

年卷期: 2024 年

页码:

收录情况: SCI

摘要: Gastrodia elata Blume from different regions varies in growth conditions, soil types, and climate, which directly affects the content and quality of its medicinal components. Accurately identifying the origin can effectively ensure the medicinal value of G. elata Bl., prevent the circulation of counterfeit products, and thus protect the interests and health of consumers. Attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy is a rapid and effective method for verifying the authenticity of traditional Chinese medicines. However, the presence of scattering effects in the spectra poses challenges in establishing reliable discrimination models. Therefore, employing appropriate scattering correction techniques is crucial for improving the quality of spectral data and the accuracy of discrimination models. This study uses two ensemble preprocessing approaches; the first type is series fusion of scatter correction technologies (SCSF), and another method is sequential preprocessing through orthogonalization (SPORT). Four discriminant models were established using a single scattering correction technique and two ensemble preprocessing approaches. The results show that the data-driven version of the soft independent modeling of class analogy (DD-SIMCA) model built based on multiplicative scatter correction (MSC) preprocessing has a sensitivity of 0.98 and a specificity of 0.91, able to effectively distinguish whether a sample of G. elata Bl. originates from Zhaotong. In addition, three discriminant models including support vector machine (SVM), partial least squares discriminant analysis (PLS-DA), and three gradient boosting machine (GBM) algorithms built using the ensemble preprocessing approach have good classification and generalization capabilities. Among them, the SCSF-PLS-DA model has the best performance with 99.68% and 98.08% accuracy for the training and test sets, respectively, and F1 of 0.97; the SPORT-SVM model achieved the second-best classification ability. The results show that the ensemble preprocessing approach used can improve the success rate of G. elata Bl. geographical origin classification.

  • 相关文献
作者其他论文 更多>>