Identification of Boletus Species Based on Discriminant Analysis of Partial Least Squares and Random Forest Algorithm
文献类型: 外文期刊
作者: Chen Feng-xia 1 ; Yang Tian-wei 2 ; Li Jie-qing 1 ; Liu Hong-gao 3 ; Fan Mao-pan 1 ; Wang Yuan-zhong 4 ;
作者机构: 1.Yunnan Agr Univ, Coll Resources & Environm Sci, Kunming 650201, Yunnan, Peoples R China
2.Yunnan Inst Trop Crops Res, Jinghong 666100, Peoples R China
3.Yunnan Agr Univ, Coll Agron & Biotechnol, Kunming 650201, Yunnan, Peoples R China
4.Yunnan Acad Agr Sci, Inst Med Plants, Kunming 650200, Yunnan, Peoples R China
关键词: Boletus; Mid-infrared spectroscopy; Ultraviolet spectroscopy; Discriminant analysis by partial least squares; Random forest; Data fusion
期刊名称:SPECTROSCOPY AND SPECTRAL ANALYSIS ( 影响因子:0.609; 五年影响因子:0.516 )
ISSN: 1000-0593
年卷期: 2022 年 42 卷 2 期
页码:
收录情况: SCI
摘要: As a famous wild edible mushroom, boletus has great edible and economic value. There are many kinds of boletus, and it is not easy to distinguish. An effective, rapid and credible species identification technology can be established to improve the quality of boletus. In this study, a total of 683 strains of 7 species of wild bolete from different regions of Yunnan were collected, the infrared and ultraviolet spectra of the samples were obtained, and the average spectral characteristics of different kinds of bolete were analyzed. Based on the single spectral data of multiple preprocessing combinations (SNV + SG, 2D-4 MSC + SNV, 1D + MSC + SNV + SG, MSCH-2D) combined with two feature value extraction methods (PCA, LVs), the partial least squares discrimination analysis and random forest algorithm combined with data fusion strategy to identify the species of boletus. There is a certain degree of innovation. The results show: (1) The average spectral absorption peaks of different types of boletus in the mid-infrared spectrum and the ultraviolet spectrum have small differences, and the absorbance has subtle differences. (2) Appropriate preprocessing can improve spectral data information. The best preprocessing combination of mid-infrared spectral data and ultraviolet spectral data for partial least square discriminant analysis and random forest algorithm model is 2D + MSC + SNV, SNV + SG, 2D + MSC + SNV, 1D+ MSC+ SNV + SG. (3) The mid-infrared spectroscopy model is better than the ultraviolet spectroscopy model in the single spectrum model. The partial least squares discriminant analysis model of the best preprocessing combination of mid-infrared spectroscopy 2D + MSC+ SNV has a correct rate of 99. 78% in the training set and 99. 12% in the validation set. The accuracy of the random forest model is 93. 20% on the training set and 99% on the validation set. (4) The data fusion strategy improves classification accuracy. The accuracy of the low-level fusion partial least squares discriminant analysis model training set and validation set is 100%, 99. 12%. The accuracy of the random forest model' s training set and validation set are 92. 32% and 99. 14%. (5) Random Forest Algorithm Intermediate Data Fusion latent variable (LVs) training set 92. 76%, validation set 96%; Intermediate Data Fusion principal components analysis (CPA) training set 97. 15%, validation set 100%. (6) Partial Least Squares Discriminant Analysis Intermediate Data Fusion (LVs) training set is 100%, and validation set is 99. 56%; the accuracy of intermediate data fusion (CPA) training set and validation set can reach 100%. Based on the discriminant analysis of the partial least squares method and random forest algorithm combined with data fusion strategy, the species identification of boletus is satisfactory. Partial Least Squares Discriminant Analysis Intermediate Data Fusion (CPA) can be used as a low-cost and high-efficiency technology for identifying boletus species.
- 相关文献
作者其他论文 更多>>
-
Data Fusion of ATR-FTIR and UV-Vis Spectra to Identify the Origin of Polygonatum Kingianum
作者:Zhang Jiao;Wang Yuan-zhong;Yang Wei-ze;Zhang Jin-yu;Zhang Jiao
关键词:Polygonatum kingianum; Origin identification; Data fusion; ATR-FTIR; UV-Vis
-
Traceability of Boletus Edulis Origin by Multispectral Analysis Combined With Mineral Elements From Different Parts
作者:Chen Feng-xia;Li Jie-qing;Fan Mao-pan;Yang Tian-wei;Liu Hong-gao;Wang Yuan-zhong
关键词:Boletus eduils; Multi-spectral analysis; Mineral; Identification of producing areas
-
The Origin Identification Study of Boletus Edulis Based on the Infrared Spctrum Data Fusion Strategy
作者:Hu Yi-ran;Li Jie-qing;Fan Mao-pan;Liu Hong-gao;Wang Yuan-zhong
关键词:Boletus edulis; Geographic origin identification; Data fusion; Fourier transform mid-infrared spectrum; Fourier transform near infrared spectrum
-
Infrared Spectral Study on the Origin Identification of Boletus Tomentipes Based on the Random Forest Algorithm and Data Fusion Strategy
作者:Hu Yi-ran;Li Jie-qing;Fan Mao-pan;Liu Hong-gao;Wang Yuan-zhong
关键词:Boletus tomentipes; Geographic origin identification; Data fusion; Fourier transform mid-infrared spectrum; Fourier transform near infrared spectrum
-
Study on Differentiation of Swertia leducii and Its Closely Relative Species Based on Data Fusion of Spectra and Chromatography
作者:Yu Ye-xia;Li Li;Yu Ye-xia;Wang Yuan-zhong
关键词:Data fusion; Species differentiation; Swertia leducii; Closely relative species; Fourier transform infrared spectroscopy; Ultra-performance liquid chromatography
-
Study of the Underground Parts Identification and Saponins Content Prediction of Panax Notoginseng Based on FTIR Combined with Chemometrics
作者:Li Yun;Zhang Ji;Jin Hang;Wang Yuan-zhong;Zhang Jin-yu;Li Yun;Zhang Ji;Jin Hang;Wang Yuan-zhong;Zhang Jin-yu;Li Yun;Zhang Jin-yu
关键词:Fourier transform infrared spectroscopy; Panax notoginseng; Main root; Rhizome; Fibrous root; Powder identification; Saponins content prediction
-
Application of 17 Classification Algorithms for Authentication Research of Various Boletus
作者:Zhang Yu;Li Jie-qing;Liu Hong-gao;Zhang Yu;Wang Yuan-zhong;Li Tao
关键词:Boletaceae; FTIR; Species identification; Different parts; data fusion