[ubuntu-za] PDF converter
Rodemire Tarazone
rodemire.tarazone at mail.com
Fri Mar 30 20:14:01 UTC 2012
Thank you guys for your quick responses, it turns out pdftotext does the conversion pretty well. It sent the 12000 paeg data to text in less than 30secs which was really impressive. I'll try the other apps as well,
Regards,
Rodemire
----- Original Message -----
From: frans
Sent: 03/30/12 05:01 PM
To: ubuntu-za at lists.ubuntu.com
Subject: Re: [ubuntu-za] PDF converter
Ok I did similar on one of two ways. 1 viewed as continuous highlighted and copied all to an excel sheet, then sort and removed unnecessary info after which I copied it to a text editor to get rid of tables and then to a word processor 2 used a text recognition program to convert it to text files, there usually would be some errors, all dependents on the data and how it's formatted pdf to text did not do it for me, that time but you might have more success On 12/03/30 13:58, William Walter Kinghorn wrote: > Hi Rodemire, > > You can try gImageReader : http://sourceforge.net/projects/gimagereader/ > > Look here for review : http://www.webupd8.org/2011/01/extract-text-from-pdfs-and-images-with.html > > William > > > ________________________________________ > From: ubuntu-za-bounces at lists.ubuntu.com [ubuntu-za-bounces at lists.ubuntu.com] On Behalf Of Rodemire Tarazone [rodemire.tarazone at mail.com] > Sent: 30 March 2012 11:51 > To: ubuntu-za at lists.ubuntu.com > Subject: [ubuntu-za] PDF converter > > Good day everybody, > > I have a problem. I need to extract text data from a huge pdf document. The document is about 12000 pages long. I need to use a proper converter to get the data into either a spreadsheet or a Libreoffice Writer document (or MS Word). Is there a tool or application that can do this for me? I use Linux Mint 10 and Linux Mint 12. > > I tried importing using Libreoffice but it gets opened by Draw which wont export teh text for me. > > Thanks in advance for any help, > > Regards, > > Rodemire > > "This e-mail is subject to our Disclaimer, to view click http://www.dut.ac.za/pages/22414" > -- When working on a computer you have to know enough: To fake what you don't know. Google what you can't fake. How (and when) to 'motivate' the computer to do what Google won't tell. Frans de Waal IT Manager/ Dormakorp -- ubuntu-za mailing list ubuntu-za at lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-za
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.ubuntu.com/archives/ubuntu-za/attachments/20120330/fb77a4d2/attachment.html>
More information about the ubuntu-za
mailing list