[ubuntu-za] PDF converter
frans
dormakorp at vodamail.co.za
Fri Mar 30 15:01:20 UTC 2012
Ok I did similar on one of two ways.
1 viewed as continuous highlighted and copied all to an excel sheet,
then sort and removed unnecessary info
after which I copied it to a text editor to get rid of tables and then
to a word processor
2 used a text recognition program to convert it to text files, there
usually would be some errors,
all dependents on the data and how it's formatted
pdf to text did not do it for me, that time but you might have more success
On 12/03/30 13:58, William Walter Kinghorn wrote:
> Hi Rodemire,
>
> You can try gImageReader : http://sourceforge.net/projects/gimagereader/
>
> Look here for review : http://www.webupd8.org/2011/01/extract-text-from-pdfs-and-images-with.html
>
> William
>
>
> ________________________________________
> From: ubuntu-za-bounces at lists.ubuntu.com [ubuntu-za-bounces at lists.ubuntu.com] On Behalf Of Rodemire Tarazone [rodemire.tarazone at mail.com]
> Sent: 30 March 2012 11:51
> To: ubuntu-za at lists.ubuntu.com
> Subject: [ubuntu-za] PDF converter
>
> Good day everybody,
>
> I have a problem. I need to extract text data from a huge pdf document. The document is about 12000 pages long. I need to use a proper converter to get the data into either a spreadsheet or a Libreoffice Writer document (or MS Word). Is there a tool or application that can do this for me? I use Linux Mint 10 and Linux Mint 12.
>
> I tried importing using Libreoffice but it gets opened by Draw which wont export teh text for me.
>
> Thanks in advance for any help,
>
> Regards,
>
> Rodemire
>
> "This e-mail is subject to our Disclaimer, to view click http://www.dut.ac.za/pages/22414"
>
--
When working on a computer you have to know enough:
To fake what you don't know.
Google what you can't fake.
How (and when) to 'motivate' the computer to do what Google won't tell.
Frans de Waal
IT Manager/
Dormakorp
More information about the ubuntu-za
mailing list