文献详情 - Gdtheory理论粤军网|广东智库信息化平台

文献详细_{Journal detailed}

基于倒排索引位运算的深度优先频繁项集挖掘
Depth First Frequent Itemset Mining Based on Bittable and Inverted Index

下载全文在线阅读

收藏

作　　者： ; ; ;

出　　处： 《小型微型计算机系统》 2012年第8期1747-1751,共5页

摘　　要： 频繁项集挖掘是关联规则挖掘中的关键任务,非常耗费时间.为提高频繁项集的产生效率,提出一种基于倒排索引位运算的深度优先频繁项集挖掘算法(DF-FIMBII).该算法以二进制数组存储项目到事务的倒排索引,通过位运算计算两个项目的支持计数,并采用深度优先搜索策略递归地挖掘不同的k-频繁项集.在chess、mushroom、pumb_star、T40I10D100K等数据集上,对DF-FIMBII、Apriori、ECLAT、BitTableFI、Index-BitTableFI等算法进行了实验比较.实验结果表明,在数据规模不是非常巨大和支持度较小的情况下,无论数据集的稠密程度如何,DF-FIMBII均具有较好的时间优越性. Frequent itemset mining is a very important procedure in the association rule mining.A Depth First Frequent Itemset Mining Based on Bittable and Inverted Index（DF-FIMBII） is proposed in this paper.DF-FIMBII stores the inverted index of the vertical structure of items to transactions with bittables arrays,and calculates the support counts of two items through the bit operations.Furthermore,DF-FIMBII computes different k-frequent itemsets with a depth first search strategy.We compared the execution time of DF-FIMBII,Apriori,ECLAT,BitTableFI and Index-BitTableFI in four datasets such as chess,mushroom,pumb_star and T40I10D100K.The experiment results show that DF-FIMBII can improve the time consumption when the volume of dataset is not very large and the support value is relative small.

关键词： 频繁项集二进制数组倒排索引深度优先搜索

领　　域： [自动化与计算机技术] [自动化与计算机技术]

基于倒排索引位运算的深度优先频繁项集挖掘
Depth First Frequent Itemset Mining Based on Bittable and Inverted Index

参考文献更多+

二级参考文献更多+

引证文献更多+

二级引证文献更多+

同被引文献更多+

耦合作品文献更多+

相关文献更多+

相关作者

相关机构对象

相关领域作者

基于倒排索引位运算的深度优先频繁项集挖掘 Depth First Frequent Itemset Mining Based on Bittable and Inverted Index

参考文献 更多+

二级参考文献 更多+

引证文献 更多+

二级引证文献 更多+

同被引文献 更多+

耦合作品文献 更多+

相关文献 更多+

相关作者

相关机构对象

相关领域作者

基于倒排索引位运算的深度优先频繁项集挖掘
Depth First Frequent Itemset Mining Based on Bittable and Inverted Index

参考文献更多+

二级参考文献更多+

引证文献更多+

二级引证文献更多+

同被引文献更多+

耦合作品文献更多+

相关文献更多+