Pdf-parser

pdf-parser
Original author(s)	Didier Stevens
Initial release	May 2, 2008
Stable release	0.3.9 / March 11, 2012; 13 years ago
Written in	Python programming language
Operating system	Multiplatform, including smart phones
Type	PDF software
License	Public domain
Website	pdf-parser

pdf-parser is a command-line program that parses and analyses PDF documents. It provides features to extract raw data from PDF documents, like compressed images. pdf-parser can deal with malicious PDF documents that use obfuscation features of the PDF language^[1].

The tool can also be used to extract data from damaged or corrupt PDF documents.

References

^ PDF Babushka by Bojan Zdrnja, Internet Storm Center, January 14, 2010

[1] PDF Babushka by Bojan Zdrnja, Internet Storm Center, January 14, 2010

[1]