Results 1 to 5 of 5

Thread: Tesseract OCR on 12.04?

  1. #1
    Join Date
    Jan 2010
    Beans
    10

    Angry Tesseract OCR on 12.04?

    Hi,
    Can any one explain how to work with the Tesseract OCR ?
    It has no GUI as far as I can see,[OCRFeed does not work].
    I read that tesseract is great program but have no clue how to run it?
    As far as I can tell it is installed on my ubuntu.

    Any suggestion for a complete beginner will be helpful.

    Thanks

  2. #2
    Join Date
    May 2005
    Location
    Indiana
    Beans
    1,935
    Distro
    Hardy Heron (Ubuntu Development)

    Re: Tesseract OCR on 12.04?

    As far as I know, you have to use Tesseract from within another program. I use gscan2pdf, which is in the repos. In gscan2pdf's OCR settings, just set it to use Tesseract.
    Today you are You, that is truer than true. There is no one alive who is Youer than You. - Dr. Seuss

  3. #3
    Join Date
    Jul 2007
    Location
    Tāmaki Makau-rau, NZ
    Beans
    5,616
    Distro
    Xubuntu 14.04 Trusty Tahr

    Re: Tesseract OCR on 12.04?

    You can use tesseract as a stand-alone program.

    The basic command for tesseract is:

    Code:
    tesseract inputfilename outputfile
    Input file can be in a number of formats - I've used .png and .tif (uncompressed). Output file will be text.

    Traditionally, tesseract didn't handle multi-column originals and other fancy layouts. You may be able to do that with some of the options which are set out in the man page (man tesseract).
    Last edited by Irihapeti; June 10th, 2012 at 11:18 PM. Reason: Added info
    Please, people, remember to BACKUP before you install that new system. Same if you're upgrading.

  4. #4
    Join Date
    Jun 2006
    Location
    UK
    Beans
    Hidden!
    Distro
    Ubuntu 15.10 Wily Werewolf

    Re: Tesseract OCR on 12.04?

    Useful community documentation here:

    https://help.ubuntu.com/community/OCR#Tesseract

    One quirk is that input TIFs must have a .tif extension, not .tiff.
    Ubuntu 15.10 Desktop Guide - Ubuntu 14.04 Desktop Guide - Forum Guide to BBCode - IRC #ubuntuforums

    Member: Not Canonical Team

    Please do not PM me about your forum account unless you have been asked to. The correct place to contact an admin about your account is here.

  5. #5
    Join Date
    Sep 2012
    Beans
    7

    Re: Tesseract OCR on 12.04?

    Linux-intelligent-ocr-solution

    http://code.google.com/p/linux-intel...-ocr-solution/

    Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image or Folder containing Images. Program is given total accessibility for visually impaired. Lios is written in python, and we release it under GPL3 license. Lios will work with Debian based operating systems. There are great many possibilities for this program, Feedback is the key to it

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •