[texhax] extracting a plain text file of the final document

Philip TAYLOR chaa006 at gmail.com
Sun Jan 15 13:58:52 CET 2012


Confirmation of provenance :

> F:\TeX\Live\2010\bin\win32>.\pdftotext
> pdftotext version 3.02pl4
> Copyright 1996-2007 Glyph & Cog, LLC
> Usage: pdftotext [options] <PDF-file> [<text-file>]
>   -f <int>          : first page to convert
>   -l <int>          : last page to convert
>   -layout           : maintain original physical layout
>   -raw              : keep strings in content stream order
>   -htmlmeta         : generate a simple HTML file, including the meta informatio
> n
>   -enc <string>     : output text encoding name
>   -eol <string>     : output end-of-line convention (unix, dos, or mac)
>   -nopgbrk          : don't insert page breaks between pages
>   -opw <string>     : owner password (for encrypted files)
>   -upw <string>     : user password (for encrypted files)
>   -q                : don't print any messages or errors
>   -cfg <string>     : configuration file to use in place of .xpdfrc
>   -v                : print copyright and version info
>   -h                : print usage information
>   -help             : print usage information
>   --help            : print usage information
>   -?                : print usage information
>
> F:\TeX\Live\2010\bin\win32>


More information about the texhax mailing list