机构地区: 中山大学物理科学与工程技术学院
出 处: 《电脑与信息技术》 2008年第2期18-21,共4页
摘 要: 针对通用数据库海量数据检索速度慢的缺点,文章提出了一个数据检索优化系统。该系统通过将海量数据拆分成短语和单词,利用哈稀算法和基数排序算法,将拆分的短语和单词重新组织成词典,并对每个短语和单词建立倒排表,利用该倒排表对通用数据库中的海量数据做索引。使用这种基于倒排表的数据索引能够将数据检索速度降低到毫秒级。 In order to reduce the disadvantage of the slow data search of common database, this article introduces a system to optimize data search. The system divides large numbers of data into terms and words. These terms and words are rebuilt to dictionary by using Hash and Radix sorting algorithm. Then the system builds inverted files based every term and word in the dictionary. Finally, the system uses the inverted files to build sort lists for the large numbers of data in the common database. Using the sort lists based on the inverted files, the speed of data search can reduce to millisecond level.
领 域: [自动化与计算机技术] [自动化与计算机技术]