Man Linux: Main Page and Category List

NAME

       hocr2pdf - hOCR to PDF converter of the ExactImage toolkit

SYNOPSIS

       hocr2pdf [option...] {-i | --input} input-file
                {-o | --output} output-file

       hocr2pdf {-h | --help}

DESCRIPTION

       ExactImage is a fast C++ image processing library. Unlike many other
       library frameworks it allows operation in several color spaces and bit
       depths natively, resulting in low memory and computational
       requirements.

       hocr2pdf creates well layouted, searchable PDF files from hOCR
       (annotated HTML) input obtained from an OCR system.

OPTIONS

       -i file, --input file
           Read image from the specified file. Note that input hOCR is read
           from the standard input.

       -o file, --output file
           Save output PDF to the specified file.

       -n, --no-image
           Don't place the image over the text. By default the text layer is
           hidden behind the image.

       -s, --sloppy-text
           Sloppily place text, group words, do not draw single glyphs.

       -r n, --resolution n
           Override resolution of the input image to n dpi. The default
           resolution (if not specified in the input file) is 300 dpi.

       -h, --help
           Display help text and exit.

EXAMPLE

           $ hocr2pdf -i scan.tiff -o test.pdf < cuneiform-out.hocr

SEE ALSO

       exactimage(7)

AUTHORS

       Jakub Wilk <jwilk@debian.org>
           Wrote this manual page for the Debian system.

       http://www.exactcode.de/site/open_source/exactimage/
           This manual page incorporates texts found on the ExactImage
           homepage.

COPYRIGHT

       This manual page was written for the Debian system (and may be used by
       others).

       Permission is granted to copy, distribute and/or modify this document
       under the terms of the GNU General Public License, Version 2 or (at
       your option) any later version published by the Free Software
       Foundation.

       On Debian systems, the complete text of the GNU General Public License
       can be found in /usr/share/common-licenses/GPL-2.