Talk:Dotted and dotless I in computing
This is the talk page for discussing improvements to the Dotted and dotless I in computing article. This is not a forum for general discussion of the article's subject. |
Article policies
|
Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL |
![]() | Writing systems Unassessed | |||||||||
|
![]() | Turkey B‑class Low‑importance | |||||||||
|
Comments
Pronunciation
Can someone please explain in layman's terms how the dotless I is pronounced? No IPA stuff, and no scientific terms about which parts of the mouth should move where, thanks. — JIP | Talk 10:02, 27 July 2005 (UTC)
- There's a sound sample at close back unrounded vowel. Muntfish 10:11, 27 July 2005 (UTC)
Casing
Why cause Turkish dotted and dotless letter I problems on Turkish computers? --84.61.43.168 14:02, 5 October 2005 (UTC)
Most codepages did make Turkish difficult. What should have been done was to encode six letters:
- LATIN CAPITAL LETTER I
- LATIN SMALL LETTER I
- LATIN CAPITAL LETTER DOTLESS I (Turkish, Azerbaijani)
- LATIN SMALL LETTER DOTLESS I (Turkish, Azerbaijani)
- LATIN CAPITAL LETTER I WITH DOT (≡I+combining dot above)
- LATIN SMALL LETTER I WITH DOT (≡i+combining dot above)
This would have made representation of multilingual texts much easier, and would have done away with any language-specific case mapping. —Typhlosion 18:14, 24 October 2005 (UTC)
- but encoding homographs in a traditional code page would have meant losing other letters or symbols and would have introduced all the issues associated with homographs (text that looks identical but isn't,having to make sure you enter the correct one of a set of homographs etc). Plugwash
Open questions
Why has neither Unicode nor ISO-8859-9 separate code points for English and Turkish small dotted I? --84.61.35.152 09:12, 28 March 2006 (UTC)
- Presumablly whoever standardised them considered them to be the same letters. Plugwash 18:12, 28 March 2006 (UTC)
- Precisely. Similarly, there is no special code point for Turkish lowercase dotted I. Its just a lowercase I. -- Jmabel | Talk 03:14, 3 April 2006 (UTC)
How can a small capital dotted i be represented in OpenType? --88.76.248.120 10:29, 9 March 2007 (UTC)
- Font size should be independent of what character is being represented. Or am I missing something? - Jmabel | Talk 06:49, 29 March 2007 (UTC)
- And it looks like it works fine:
İ | İ
| |
İ | İ
|
JDK 6 fixes for dotless-I
Specific issues in the JDK relating to improper treatment of 'I' and 'i' in Turkish locale have been fixed in JDK 6. But it is misleading to say that this was a single bug which is now fixed. These usages were simply usages of the basic API, similar to those in any Java software. The basic problem is that the methods String.toLowerCase() and String.toUpperCase() without arguments exhibit behavior depending on the default locale, which in practice means that they work consistently for everyone except Turkish users. Developers must be explicitly aware of this danger and should use the variants of these methods which take a Locale argument - Locale.getDefault() if that is what is really intended, or (for example) Locale.US for simple ASCII conversions.