机构地区: 河北大学数学与计算机学院
出 处: 《计算机工程》 2006年第23期202-204,228,共4页
摘 要: 印刷体数学公式识别是OCR技术的重要组成部分,也是识别技术发展的瓶颈所在。在介绍公式识别技术发展现状的基础上,针对结构分析这一公式识别的关键环节,提出了一种基于基准线和字符间空白域特征的公式二维结构分析方法,并将语义和语境分析策略融入其中。实验表明,这种方法对公式结构分析具有较好的鲁棒性和应用前景。 Mathematical expressions recognition is an important part of OCR technology. It is also a bottleneck in the development of recognition technology. To the structural analysis stage, which is a crucial course in printed formula recognition, this paper proposes a method which makes use of baseline and operator range with syntax analysis based on the introduction of the development state of mathematical expressions recognition. In experiments, this method shows robust adaptability for the structure of mathematical expressions, and will have a good foreground.
领 域: [自动化与计算机技术] [自动化与计算机技术]