In Linux flavours How to pdf2txt

pdftotext is installed using poppler-utils package under various Linux distributions:
# yum install poppler-utils
OR use the following under Debian / Ubuntu Linux
$ sudo apt-get install poppler-utils

pdftotext syntax

pdftotext {PDF-file} {text-file}

How do I convert a pdf to text?
Convert a pdf file called hp-manual.pdf to hp-manual.txt, enter:
$ pdftotext hp-manual.pdf hp-manual.txt
Specifies the first page to convert, enter:
$ pdftotext -f 5 hp-manual.pdf hp-manual.txt
Specifies the last page to convert, enter:
$ pdftotext -l 5 hp-manual.pdf hp-manual.txt
Convert a pdf file protected and encrypted by owner password:
$ pdftotext -opw 'password' hp-manual.pdf hp-manual.txt
Convert a pdf file protected and encrypted by user password:
$ pdftotext -upw 'password' hp-manual.pdf hp-manual.txt
Sets the end-of-line convention to use for text output. You can set it to unix, dos or mac. For UNIX / Linux oses, enter:
$ pdftotext -eol unix hp-manual.pdf hp-manual.txt

1 comment:

  1. Hello:
    In the Downloads folder is the arch MVF01.. and I write pdftotext MVF01de06.pdf mvf01.txt and the result is a file wlth the text file name but it is empty. Could you help me, what is wrong with it. thank you in advance manuel

    ReplyDelete