Jump to content

Comparison of HTML parsers

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 2.137.116.117 (talk) at 16:43, 13 December 2012 (Undid revision 525004659 by AlexPenson (talk) Beautiful Soup references are already available on its Wikipedia page). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
Parser License Implementation language(s) Latest release date
Beautiful Soup Python Software Foundation License Python 2012-08-20
html5lib MIT License Python and PHP 2012-02-11[1]
HTML::Parser Perl license Perl 2011-10-11
HTML Tidy W3C license ANSI C 2009-03-25[2]
HtmlCleaner BSD License[3] Java 2010-12-22[4]
Jericho HTML Parser Eclipse Public License Java 2012-10-30[5]
jsdom MIT license JavaScript 2012-07-12
jsoup MIT license Java 2012-09-23[6]
JTidy JTidy License Java 2009-12-01[7]
libxml2 HTMLparser MIT License C (programming language) 2012-09-11[8]
NekoHTML Apache License 2.0 Java 2012-11-05[9]
TagSoup Apache License 2.0 Java 2011-07-07
Validator.nu HTML Parser MIT License Java 2012-06-05
Parser License Implementation language(s) Latest release date

References