Let me save as raw text
The PDF is all very pretty, but I want to get at the raw text. Let me save as text rather than PDF.
22
votes
AdminDuncan McGregor
(Admin, VelOCRaptor)
shared this idea
-
Anonymous commented
I know of only two free tools that do a decent job of text extraction from pdf (considering layout complexities). They are pdftotext (there are several by that name i mean the one coming with xpdf) and multivalent. There needs to be two modes (which at least pdftotext supports): one that attempts to preserve layout even in plain text, and the other that just gets out raw strings (this is easier).