帮助 本站公告
您现在所在的位置:网站首页 > 知识中心 > 文献详情
文献详细Journal detailed

SPXY样本划分法及蒙特卡罗交叉验证结合近红外光谱用于橘叶中橙皮苷的含量测定
Determination of Hesperidin in Tangerine Leaf by Near-Infrared Spectroscopy with SPXY Algorithm for Sample Subset Partitioning and Monte Carlo Cross Validation

作  者: ; ; ; ; ;

机构地区: 北京中医药大学中药学院

出  处: 《光谱学与光谱分析》 2009年第4期964-968,共5页

摘  要: 在近红外光谱PLS定量模型的建立过程中训练集样本的选取和潜变量数的确定是十分重要的。因此,该研究以橘叶中橙皮苷的含量检测为例,分别比较了random sampling(RS),Kennard-Stone(KS),duplex,sample set partitioning based on joint x-y distance(SPXY)四种训练集样本的选取方法对模型的影响,以及留一交互验证法和蒙特卡罗法对潜变量数确定的影响。结果表明,SPXY法选取的训练集建立的模型优于其他三种方法,蒙特卡罗法能够较好地确定模型的潜变量数并有效地减少过拟合风险,所建模型的交互验证均方根,预测均方根及预测集相关系数分别为0.7681,0.7369,0.9752。 It is very crucial that a representative training set can be extracted from a pool of real samples. Moreover, it is difficult to determine the adapted number of latent variables in PLS regression. For comparison, PLS models were constructed by SPXY, as well as by using the random sampling, duplex and Kennard-Stone methods for selecting a representative subset during the measurement of tangerine leaf. In order to choose correctly the dimension of calibration model, two methods were applied, one of which is leave-one-out cross validation and the other is Monte Carlo cross validation. The results present that the correlation co- efficient of the predicted model is 0. 996 9, RMSECV is 0. 768 1, and RMSEP is 0. 736 9, which reveal that SPXY is superior to the other three strategies, and Monte Carlo cross validation can successfully avoid an unnecessary large model, and as a result decreases the risk of over-fitting for the calibration model.

关 键 词: 近红外光谱法 训练集选择 潜变量数 蒙特卡罗法

领  域: [理学] [理学]

相关作者

作者 王增文
作者 程海洲

相关机构对象

机构 华南农业大学公共管理学院
机构 华南理工大学
机构 暨南大学

相关领域作者

作者 刘广平
作者 彭刚
作者 杨科
作者 陈艺云
作者 崔淑慧