机构地区: 深圳大学计算机与软件学院
出 处: 《现代图书情报技术》 2011年第11期9-16,共8页
摘 要: 设计面向综合性中文叙词表本体的叙词概念定义抽取方法,获得良好的实验效果并已投入实际应用。其中,基于"高频词与句子向量"和"TF*IDF向量"两种定义抽取算法提出的二维相对量的融合算法,能够更有效地抽取出前两种方法的良好结果,有效信息提高比一般可达到60%。 The paper proposes some methods of definition extraction for concepts in the comprehensive OntoThesaurus. They achieve good experiment effects and are applied to the actual OTCSS. Among them, an integrated algorithm named "two - dimensional relative quantity" based on "high - frequency words vector" and "TF * IDF vector" is presented. This algorithm can much effectively extract good results from that of the first two methods, and the effective information impro- ving ratio can reach 60% generally.