机构地区: 扬州大学信息工程学院
出 处: 《计算机应用》 2010年第2期423-426,448,共5页
摘 要: 提出一种基于最小闭树特征集的聚类与分类方法,有效地解决了在实际应用中因数据量大而无法聚类与分类的问题。其基本思想为:以最小闭树特征集作为候选聚类与分类特征,采用动态阈值按相似度聚类,使得树聚类快速而精确;提出树分类规则等级概念,并应用于树分类方法中,能迅速预测未知的树结构。实验结果表明,在树节点数较多或数据量大时,新方法有效可行,且与类其他方法相比效率有显著提高。 A tree clustering and classification algorithm was proposed based on least closed tree, which effectively solved the problem that the clustering and classification can not be completed when data amount is very large in practical application. Least closed tree was regarded as the candidate cluster and classification features. The dynamic threshold was used for similarity cluster to make tree cluster operation rapid and accurate. Meanwhile the concept of tree classification rule grade was proposed and applied in tree classification algorithm, so that the unknown tree structure could be predicted promptly. Experimental results show that the method has higher speed and efficiency than that of other similar ones especially with large number of tree nodes.
领 域: [自动化与计算机技术] [自动化与计算机技术]