HTML sanitization
Appearance
This article needs additional citations for verification. (December 2009) |
HTML sanitization is the process of examining an HTML document and producing a new HTML document that preserves only whatever tags are designated "safe". HTML sanitization can be used to protect against cross-site scripting attacks by sanitizing any HTML code submitted by a user.
Tags often allowed are <b>, <i>, <u>, <em>, and <strong>.
In PHP this can be performed using the strip_tags()
or htmlspecialchars()
functions.[1][2]
In Java this can be achieved by using OWASP Java HTML Sanitizer Project [3]
References