[ubuntu-za] PDF converter

frans dormakorp at vodamail.co.za
Fri Mar 30 15:01:20 UTC 2012


Ok I did similar on one of two ways.

1 viewed as continuous highlighted and copied all to an excel sheet,
      then sort and removed unnecessary info
     after which I copied it to a text editor to get rid of tables and then
     to a word processor

2 used a text recognition program to convert it to text files, there 
usually would be some errors,
all dependents on the data and how it's formatted

pdf to text did not do it for me, that time but you might have more success



On 12/03/30 13:58, William Walter Kinghorn wrote:
> Hi Rodemire,
>
> You can try gImageReader : http://sourceforge.net/projects/gimagereader/
>
> Look here for review : http://www.webupd8.org/2011/01/extract-text-from-pdfs-and-images-with.html
>
> William
>
>
> ________________________________________
> From: ubuntu-za-bounces at lists.ubuntu.com [ubuntu-za-bounces at lists.ubuntu.com] On Behalf Of Rodemire Tarazone [rodemire.tarazone at mail.com]
> Sent: 30 March 2012 11:51
> To: ubuntu-za at lists.ubuntu.com
> Subject: [ubuntu-za] PDF converter
>
> Good day everybody,
>
> I have a problem. I need to extract text data from a huge pdf document. The document is about 12000 pages long. I need to use a proper converter to get the data into either a spreadsheet or a Libreoffice Writer document (or MS Word). Is there a tool or application that can do this for me? I use Linux Mint 10 and Linux Mint 12.
>
> I tried importing using Libreoffice but it gets opened by Draw which wont export teh text for me.
>
> Thanks in advance for any help,
>
> Regards,
>
> Rodemire
>
> "This e-mail is subject to our Disclaimer, to view click http://www.dut.ac.za/pages/22414"
>

-- 
  When working on a computer you have to know enough:
  To fake what you don't know.
  Google what you can't fake.
  How (and when) to 'motivate' the computer to do what Google won't tell.

Frans de Waal
IT Manager/
Dormakorp




More information about the ubuntu-za mailing list