diff options
author | Willy Sudiarto Raharjo <willysr@slackbuilds.org> | 2023-09-21 22:38:24 +0700 |
---|---|---|
committer | Willy Sudiarto Raharjo <willysr@slackbuilds.org> | 2023-09-23 08:42:56 +0700 |
commit | 13c83954e83e6039d2c57423714b5ae1a63c0a32 (patch) | |
tree | eca9dbc0450e4096321b3c52d4f306d9fbb19af6 /python/python2-pdfminer/README | |
parent | e1e000c0f38e58e0f22323ee3cf3dd521aa01b36 (diff) |
python/python2-pdfminer: Removed (use python3-pdfminer.six).
Signed-off-by: Willy Sudiarto Raharjo <willysr@slackbuilds.org>
Diffstat (limited to 'python/python2-pdfminer/README')
-rw-r--r-- | python/python2-pdfminer/README | 23 |
1 files changed, 0 insertions, 23 deletions
diff --git a/python/python2-pdfminer/README b/python/python2-pdfminer/README deleted file mode 100644 index 64ca2affa2ffd..0000000000000 --- a/python/python2-pdfminer/README +++ /dev/null @@ -1,23 +0,0 @@ -PDFMiner is a tool for extracting information from PDF documents. Unlike -other PDF-related tools, it focuses entirely on getting and analyzing -text data. PDFMiner allows one to obtain the exact location of text in a -page, as well as other information such as fonts or lines. It includes a -PDF converter that can transform PDF files into other text formats (such -as HTML). It has an extensible PDF parser that can be used for other -purposes than text analysis. - -PDFMiner comes with two handy tools: pdf2txt.py and dumppdf.py. - -pdf2txt.py - -pdf2txt.py extracts text contents from a PDF file. It cannot recognize -text drawn as images. It also extracts locations, font names/sizes, -writing direction. It requires a password for password protected PDF -documents. You cannot extract any text from a PDF document which does -not have extraction permission. - -dumppdf.py - -dumppdf.py dumps the internal contents of a PDF file in pseudo-XML -format. This program is primarily for debugging purposes, but it's also -possible to extract some meaningful contents (e.g. images). |