Results 1 to 5 of 5

Thread: Tesseract OCR on 12.04?

  1. #1
    Join Date
    Jan 2010
    Beans
    10

    Angry Tesseract OCR on 12.04?

    Hi,
    Can any one explain how to work with the Tesseract OCR ?
    It has no GUI as far as I can see,[OCRFeed does not work].
    I read that tesseract is great program but have no clue how to run it?
    As far as I can tell it is installed on my ubuntu.

    Any suggestion for a complete beginner will be helpful.

    Thanks

  2. #2
    Join Date
    May 2005
    Location
    Indiana
    Beans
    1,933
    Distro
    Hardy Heron (Ubuntu Development)

    Re: Tesseract OCR on 12.04?

    As far as I know, you have to use Tesseract from within another program. I use gscan2pdf, which is in the repos. In gscan2pdf's OCR settings, just set it to use Tesseract.
    Today you are You, that is truer than true. There is no one alive who is Youer than You. - Dr. Seuss

  3. #3
    Join Date
    Jul 2007
    Location
    Tāmaki Makau-rau, NZ
    Beans
    Hidden!
    Distro
    Ubuntu Development Release

    Re: Tesseract OCR on 12.04?

    You can use tesseract as a stand-alone program.

    The basic command for tesseract is:

    Code:
    tesseract inputfilename outputfile
    Input file can be in a number of formats - I've used .png and .tif (uncompressed). Output file will be text.

    Traditionally, tesseract didn't handle multi-column originals and other fancy layouts. You may be able to do that with some of the options which are set out in the man page (man tesseract).
    Last edited by Irihapeti; June 10th, 2012 at 11:18 PM. Reason: Added info
    BACKUPS are unsexy — until you discover you should have done one yesterday.
    Spare your nerves and do one before you upgrade or install.

  4. #4
    Join Date
    Jun 2006
    Location
    UK
    Beans
    Hidden!
    Distro
    Ubuntu 22.04 Jammy Jellyfish

    Re: Tesseract OCR on 12.04?

    Useful community documentation here:

    https://help.ubuntu.com/community/OCR#Tesseract

    One quirk is that input TIFs must have a .tif extension, not .tiff.
    Ubuntu 20.04 Desktop Guide - Ubuntu 22.04 Desktop Guide - Forum Guide to BBCode - Using BBCode code tags

    Member: Not Canonical Team

    If you need help with your forum account, such as SSO login issues, username changes, etc, the correct place to contact an admin is here. Please do not PM me about these matters unless you have been asked to - unsolicited PMs concerning forum accounts will be ignored.

  5. #5
    Join Date
    Sep 2012
    Beans
    7

    Re: Tesseract OCR on 12.04?

    Linux-intelligent-ocr-solution

    http://code.google.com/p/linux-intel...-ocr-solution/

    Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image or Folder containing Images. Program is given total accessibility for visually impaired. Lios is written in python, and we release it under GPL3 license. Lios will work with Debian based operating systems. There are great many possibilities for this program, Feedback is the key to it

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •