Results 1 to 8 of 8

Thread: to convert pdf to excel

  1. #1
    Join Date
    Nov 2008
    Beans
    224

    to convert pdf to excel

    I am using Ubuntu 10.04.
    I have a pdf statement and want to convert it to an excel file format.
    Is there any OCR engine/software that can accomplish this task?
    Can someone please advise.

  2. #2
    Join Date
    Nov 2005
    Location
    Lincolnshire, UK
    Beans
    1,461
    Distro
    Ubuntu 10.04 Lucid Lynx

    Re: to convert pdf to excel

    wstay: One possibility is to open the pdf file with PDFedit then save the file as a text file.
    Code:
    sudo apt-get install pdfedit
    You can then import the text file into OpenOffice.org (or LibreOffice) Calc etc and save the resulting spreadsheet in the desired format (eg. ods, xls etc).

    To import the text file into Calc, open the Text Import dialog box via the Insert, Sheet from File menu. Then choose either "Fixed width" or the "Separated by" options as appropriate for your data. Note that if you use the "Fixed width" option you can set each column width via the "Fields" preview of the box.

  3. #3
    Join Date
    May 2012
    Beans
    7

    Re: to convert pdf to excel

    Usually I google for online sites that do the trick. But I will try this method of PDFedit, seems good to me.

    Thanks.

  4. #4
    Join Date
    Nov 2008
    Beans
    224

    Re: to convert pdf to excel

    Quote Originally Posted by Zill View Post
    wstay:
    To import the text file into Calc, open the Text Import dialog box via the Insert, Sheet from File menu. Then choose either "Fixed width" or the "Separated by" options as appropriate for your data. Note that if you use the "Fixed width" option you can set each column width via the "Fields" preview of the box.
    I cannot find the "Fixed width" or the "Separated by" options from the
    Insert, Sheet from File menu. Please refer to attachment.
    Where can I get these options.
    Please help.
    Attached Images Attached Images

  5. #5
    Join Date
    Nov 2005
    Location
    Lincolnshire, UK
    Beans
    1,461
    Distro
    Ubuntu 10.04 Lucid Lynx

    Re: to convert pdf to excel

    wstay: It looks like you chose the "Insert" menu option "Sheet", rather than "Sheet from file". If you choose "Sheet from file" you should then get the option to select your text file and then select the delimiters as described in my earlier post.

    ps. You can still do this even by selecting the "Sheet" option. The screen shot you posted shows the radio button "From file" and if you select this it gives the same options as I described above.

  6. #6
    Join Date
    Nov 2008
    Beans
    224

    Re: to convert pdf to excel

    Quote Originally Posted by Zill View Post
    wstay: It looks like you chose the "Insert" menu option "Sheet", rather than "Sheet from file". If you choose "Sheet from file" you should then get the option to select your text file and then select the delimiters as described in my earlier post.

    ps. You can still do this even by selecting the "Sheet" option. The screen shot you posted shows the radio button "From file" and if you select this it gives the same options as I described above.
    Using the Insert "Sheet from file" I got screen shots: Screenshot-1 and Screenshot-2.
    When I open the selected the text file ('statement' as in Screenshot-2) that I safe from a pdf file with pdfedit I got Screenshot-3.
    I use the Browse button to select and open the 'statement' text file and I got back to Screenshot-3.
    There are no delimiters such as "Fixed width" or the "Separated by" options as you mentioned earlier.
    Attached Images Attached Images

  7. #7
    Join Date
    Jun 2009
    Location
    Oregon
    Beans
    399

    Re: to convert pdf to excel

    I believe you need to copy the text from the PDF file and paste it into a blank spreadsheet to get the column options. I'm using a newer version of Open Office/Libre Office, so YMMV.

    If the statement is an image rather than embedded text, you will need OCR software.

  8. #8
    Join Date
    Nov 2005
    Location
    Lincolnshire, UK
    Beans
    1,461
    Distro
    Ubuntu 10.04 Lucid Lynx

    Re: to convert pdf to excel

    wstay: I am not quite sure which version of OOo you are using but it looks different to mine! I use the standard Ubuntu 10.04 version so I guess you must be using a different build as the "Insert Sheet from File" appears on your "File" menu, rather than on the "Insert" menu as in my version. However, I guess the functionality remains the same and so I have attached some screendumps to show exactly how this works on my version.

    Firstly, I suggest you make sure you have saved your pdf file as a text file (using PDFedit). Although this shouldn't be necessary (this is Linux after all!) you may like to save your text file with a .txt extension. Then, using the "Nautilus" file manager, click on the text file and it should open in the text editor "Gedit". If it does not then the file has not been converted properly.

    Then, using the OOo "Insert Sheet from File" menu, open your text file as shown in my attachments.

    This should then open the "Text Import" dialog box which will allow you to select the required delimiter. If you choose "Fixed width" then you can drag the column markers in the data preview area at the bottom of the box. Finally, click "OK" and your data should be on a spreadsheet, just awaiting final "tweaking" to tidy things up!
    Attached Images Attached Images
    Last edited by Zill; June 25th, 2012 at 06:06 PM.

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •