Jump to content

Pdf-parser

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Shalajack (talk | contribs) at 01:32, 16 December 2011 (References). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
pdf-parser
Original author(s)Didier Stevens
Initial releaseMay 2, 2008 (2008-05-02)
Stable release
0.3.8 / April 4, 2010; 15 years ago (2010-04-04)
Written inPython programming language
Operating systemMultiplatform, including smart phones
TypePDF software
LicensePublic domain
Websitepdf-parser

pdf-parser is a command-line program that parses and analyses PDF documents. It provides features to extract raw data from PDF documents, like compressed images. pdf-parser can deal with malicious PDF documents that use obfuscation features of the PDF language[1].

The tool can also be used to extract data from damaged or corrupt PDF documents.

References

  1. ^ PDF Babushka by Bojan Zdrnja, Internet Storm Center, January 14, 2010

PDF editor for mac