跳转到内容

Unicode

维基百科,自由的百科全书

这是本页的一个历史版本,由Tomchiukc留言 | 贡献2004年4月2日 (五) 02:16编辑。这可能和当前版本存在着巨大的差异。

(差异) ←上一修订 | 最后版本 (差异) | 下一修订→ (差异)


HTML 4.0 採用 Unicode 作為標準字集,利用特定的數字代碼,我們可以在各種支援HTML 4.0 標準的瀏覽器上顯示任何地區文字的網頁,只要電腦本身安裝有合適的字型檔就行。

過去電腦編碼的8位標準,使每個國家都只可以按國家使用的字符而編定各自的編碼系統;而對於部份字符系統比較複雜的語言,如越南語,又或者東亞國家的大型字符集,都不能在8位的環境下好好顯示。連自己的語言也未必可以好好顯示的話,就更惶論顯示其他國家的文字了。然而,現在我們可以利用 HTML 4.0 的新命令,利用&#nnn;的格式顯示特定的字符。nnn代表該字符的十進位Unicode代碼。如果想採用十六進位代碼的話,要在編碼之前加上x字符。

以下的請幫忙翻譯

The support for hexadecimal in this context is more recent, so older browsers might have problems displaying those characters – but they will probably have a problem displaying Unicode characters outside the 8-bit range in the first place. It is still a common practice to convert the hexadecimal code point into a decimal value (e.g. ♠ instead of ♠).

In the Unicode standard, each code point is expressed in the notation U+hhhh, where hhhh are the hexadecimal digits.

There is also a standard set of named character entity references for commonly used symbols outside of some character encodings, so one can use —, for example, to represent an em dash—like this—in text even if the character encoding used doesn't contain that character.

Many browsers, though, are only capable of displaying a small subset of the full UCS-2 repertoire. Here is how your browser displays various Unicode code points:

CodeDescriptionWhat your browser displays
ALatin capital letter "A"A
ßLatin small letter "Sharp S"ß
þLatin small letter "Thorn"þ
ΔGreek capital letter "Delta"Δ
ЙCyrillic capital letter "Short I"Й
קHebrew letter "Qof"ק
مArabic letter "Meem"م
๗Thai digit 7
&amp#4688;Ethiopic syllable "Qha"
あJapanese Hiragana "A"
叶Simplified Chinese "Leaf"
葉Traditional Chinese "Leaf"
냻Korean Hangul syllable "Nieun Yae Rieulhieuh"

Some multilingual web browsers that dynamically merge the required font sets on demand, e.g., Microsoft's Internet Explorer 5.5 on Windows, or Mozilla/Netscape 6 cross-platform, are capable of displaying all the Unicode characters on this page simultaneously after the appropriate "text display support packs" are downloaded. MSIE 5.5 would prompt the users if a new font were needed via its "install on demand" feature. Other browsers such as Netscape Navigator 4.77 can only display text supported by the current font associated with the character encoding of the page. When you are using the latter type of browser, it is unlikely that your computer has all of those fonts, nor the browser can use all available fonts on the same page. As a result, the browser will not display the text above all correctly, though it may display a subset of them. Because they are encoded according to the standard, though, they will display correctly on any system that is compliant and does have the characters available. Further, those characters given names for use in named entity references are likely to be more commonly available than others.

外部連結