HTML解析器对比
外观
此條目目前正依照其他维基百科上的内容进行翻译。 (2014年5月) |
解析HTML是一项自动化工作,由(所谓的)HTML解析器执行。它们有两个主要目的:
- * Latest release (of significant changes) date.
- ** sanitize (generating standard-compatible web-page, reduce spam, etc.) and clean (strip out surplus presentational tags, remove XSS code, etc.) HTML code.
- *** Updates HTML4.X to XHTML or to HTML5, converting deprecated tags (ex. CENTER) to valid ones (ex. DIV with style="text-align:center;").
References
- ^ 12.2 Parsing HTML documents — HTML Standard
- ^ http://www.crummy.com/software/BeautifulSoup/
- ^ Releases · html5lib/html5lib-python
- ^ Bug #53300 for HTML-Parser: HTML 5
- ^ HTML Tidy for Windows
- ^ HTML Tidy for Windows
- ^ Tidy parser example: class.tidynode of PHP
- ^ HtmlCleaner is distributed under BSD License
- ^ Jericho HTML Parser - Browse /jericho-html/3.3 at SourceForge.net
- ^ jsoup/CHANGES at master · jhy/jsoup · GitHub
- ^ JTidy - Browse /JTidy at SourceForge.net
- ^ libxml2 Releases
- ^ NekoHTML | Change History