Skip to content
Sections
>> Trisquel >> 软件包 >> etiona >> python >> python-gamera.toolkits.ocr
etiona  ]
[ 源代码: ocr4gamera  ]

软件包: python-gamera.toolkits.ocr (1.2.2-5)

toolkit for building OCR systems

The Gamera OCR Toolkit is meant to help building optical character recognition (OCR) systems for standard text documents. Even though it can be used as is, it is specifically designed to make individual steps of the recognition system customizable and replaceable. It provides:

 * a flexible mechanism for plugging in custom page segmentation algorithms
 * heuristic rules for dealing with diacritics, and for disambiguation of
   common confused roman characters (like comma and apostrophe, or lower
   and upper case ‘W’)
 * a ready-to-run script ocr4gamera which acts as a basic OCR-system.

Note that the toolkit does not include any training data.

其他与 python-gamera.toolkits.ocr 有关的软件包

  • 依赖
  • 推荐
  • 建议
  • dep: python
    interactive high-level object-oriented language (default version)
  • dep: python-gamera (>= 3.2.6)
    framework for building document analysis applications
  • sug: aspell
    GNU Aspell spell-checker
    或者 ispell
    International Ispell (an interactive spelling corrector)
  • enh: python-gamera
    framework for building document analysis applications

下载 python-gamera.toolkits.ocr

下载可用于所有硬件架构的
硬件架构 软件包大小 安装后大小 文件
all 100.7 kB327 kB [文件列表]