机构地区: 上海交通大学电子信息与电气工程学院电子工程系
出 处: 《计算机工程》 2003年第11期95-97,共3页
摘 要: 提出了一种实现对中文网页进行自动分类的平衡差值法,它利用本体中主题概念 的层次结构和主题词?特征项的各种语义关系,降低了分类算法的复杂性和计算量?试验表 明,该方法可以获得85%以上的网页分类准确率? With the explosion of the information on Internet, automatically cla ssifying Web pages is becoming an important problem that information retrieval a nd information search have to be faced. This paper proposes a balance difference algorithm, which uses the semantic relations between topic words, feature items and utilizes the hierarchical structure of the ontology concepts to reduce the complexity and computation of the classification of Chinese Web pages. Experimen ts have proved that this method can get 85% precision at least.
领 域: [自动化与计算机技术] [自动化与计算机技术]