机构地区: 中山大学资讯管理学院资讯管理系
出 处: 《情报理论与实践》 2008年第4期594-597,共4页
摘 要: 目前网上存在大量揭载学术论文的网页,而这些学术文献尚未被加以开发、组织和保存。非主题特征指与文献的主题没有直接关联,即在标引以及检索时不以叙词或主题性关键词表述的特征。文体特征是文献的非主题特征之一,利用它对学术文献进行文体分析,提供了检索网络学术文献的新途径。本文在Google的基础上设计、开发了一个实验系统,并利用此系统检验使用文体特征检索网络学术文献的效果。实验表明,文体特征在一定程度上提高了查找网络学术文献的准确率。 At present, there is a large amount of Web pages containing academic contents in WWW, however, these academic literatures haven' t yet been explored, organized and preserved. Non-subject features usually refer to the features that are not directly related to the subject of the literature, in another word, they are not presented by descriptors or keywords in indexing and retrieving. Stylistic characteristics are one of the non-subject features of the literature, by utilizing them to analyze the academic literature stylistically, we obtain a new approach to online academic literature retrieval. This paper designs and develops an experimental system based on Google, and uses this system to evaluate the effect of using stylistic characteristics to retrieve the online academic literature. The experimental results show that stylistic characteristics improve the precision of online academic literature retrieval to some extent.