Jump to content

Pdf-parser

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by PDFAnalyst (talk | contribs) at 19:21, 3 April 2012 (New version). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
pdf-parser
Original author(s)Didier Stevens
Initial releaseMay 2, 2008 (2008-05-02)
Stable release
0.3.9 / March 11, 2012; 13 years ago (2012-03-11)
Written inPython programming language
Operating systemMultiplatform, including smart phones
TypePDF software
LicensePublic domain
Websitepdf-parser

pdf-parser is a command-line program that parses and analyses PDF documents. It provides features to extract raw data from PDF documents, like compressed images. pdf-parser can deal with malicious PDF documents that use obfuscation features of the PDF language[1].

The tool can also be used to extract data from damaged or corrupt PDF documents.

References

  1. ^ PDF Babushka by Bojan Zdrnja, Internet Storm Center, January 14, 2010