Jump to content

HTML sanitization

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 187.72.61.4 (talk) at 16:19, 5 December 2011. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

HTML sanitization is the process of examining an HTML document and producing a new HTML document that preserves only whatever tags are designated "safe". HTML sanitization can be used to protect against cross-site scripting and SQL injection attacks by sanitizing any HTML code submitted by a user.

Tags often allowed are <b>, <i>, <u>, <em>, and <strong>.

In PHP this can be performed using the strip_tags() function.[1]

In Java this can be achieved by using OWASP Java HTML Sanitizer Project [2]

References