Skip to content
Sections
>> Trisquel >> Packages >> etiona >> text >> ocrodjvu
etiona  ]
[ Source: ocrodjvu  ]

Package: ocrodjvu (0.10.2-1)

tool to perform OCR on DjVu documents

Ocrodjvu is a wrapper around the Optical Character Recognition (OCR) systems Cuneiform, Gocr, Ocrad, OCRopus and (standalone) Tesseract. It is designed for OCR on documents in DjVu format, which is especially suited for high-quality archiving of books.

After processing, the DjVu document embeds a text layer. Other programs can then be used to read the document, search it for specific terms, print it out, or use the information in the OCR layer as a way to improve the document's accessibility.

Other Packages Related to ocrodjvu

  • depends
  • recommends
  • suggests
  • rec: python-html5lib
    HTML parser/tokenizer based on the WHATWG HTML5 specification
  • rec: python-lxml
    pythonic binding for the libxml2 and libxslt libraries
  • rec: python-pyicu (>= 1.0~)
    Python extension wrapping the ICU C++ API
  • rec: python-subprocess32
    backport of the Py3 stdlib subprocess module for Py2
  • rec: tesseract-ocr
    Tesseract command line OCR tool
  • sug: cuneiform
    Package not available
  • sug: gocr
    Command line OCR
  • sug: ocrad
    optical character recognition program

Download ocrodjvu

Download for all available architectures
Architecture Package Size Installed Size Files
all 36.2 kB184 kB [list of files]