keep the commandline and saving as plain text
please, to simply create workflows it would be great to stick with the unix command-line. Maybe more switches?
-o <output> # -o - or ommited switch -> then goes to stdout
-f pdf,txt,rtf # format the written output as pdf, as txt, as rtf
-q # autoscale the input
-r # replace original
-n "fmt" # name the destination according to fmt starting with dirname of the argument (./../../destdir/myscans/yyyy-mm-dd)
split tokens to variables: -w 1,5 # take first 5 words for the name
split tokens to variables: -l 3 # take 3 lines for the name
-p # create path for destination
-s|-S <value> # summarize by value, -s by sentence, -S by paragraph
-v|-V # verbose/noVerbose logging to console
-
Oliver commented
I would be very happy if I just could create searchable PDFs in the different Tesseract languages, never mind the GUI. Suggestion:
--lang <lang> #Tesseract language/script code (deu, fra, etc.)
-
I think it's got to the stage where the command-line is essential to the testing of the product - so it's likely to stay.
Most of the options that you want can be replicated in script invoking velocraptor.rb, so you'll forgive me if I pass on them.