Digraph (orthography)
A digraph or bigraph is a pair of letters used to write one sound or a combination of sounds that does not correspond to the written letters combined. This is often, but not necessarily, a sound (or more precisely a phoneme) which cannot be expressed using a single letter in the alphabet used for writing.
Sometimes, when digraphs do not represent a new phoneme, they are a relic from an earlier period in the language's history when they did (or remain phonemic only in certain dialects, e.g. wh in English).
Some schemes of transliteration into the Roman alphabet make extensive use of digraphs (e.g. Cyrillic to Roman for English readers), while others rely solely on diacritics (e.g. Cyrillic to the modified Roman used for Turkish). To avoid ambiguity, transliteration based on diacritics is generally preferred in academic circles. Many writing systems, like Cyrillic and Devanagari, have no digraphs, and so transliterations into languages using them also cannot use digraphs.
There are three kinds of digraphs: sequences, reversals (really a special kind of sequence) and doubled letters.
Sequences
This is a group of two different letters in a specific order.
Examples from languages include:
- Czech
- ch represents /x/ (voiceless velar fricative)
- dž represents /ʤ/ (voiced postalveolar affricate), it occurs almost in words of foreign origin (e. g. džem /ʤɛm/, jam).
- Dutch
- ch represents /x/ (voiceless velar fricative)
- eu represents /ø/ (close-mid front rounded vowel)
- ie represents /i/ (close front unrounded vowel)
- ng represents /ŋ/ (velar nasal)
- oe represents /u/ (close back rounded vowel)
- sj represents /ʃ/ (voiceless postalveolar fricative)
- English
- ch usually corresponds to /tʃ/ (voiceless postalveolar affricate), less often to /ʃ/ (voiceless postalveolar fricative)
- wh represents /ʍ/ (voiceless labial-velar fricative; see also hwair)
- th usually corresponds to /θ/ (voiceless interdental fricative) or /ð/ (voiced interdental fricative)
- sh represents /ʃ/, (voiceless postalveolar fricative)
- ng represents /ŋ/ (velar nasal)
- kn represents /n/ (alveolar nasal)
- ph represents /f/ (voiceless labiodental fricative)
- gh represents /f/ (voiceless labiodental fricative) or is silent
- ck represents /k/ (voiceless velar plosive)
- ea, ie, ei correspond mostly to /i/ (close front unrounded vowel)
- ai, ay correspond mostly to /ei/ (diphthong: close-mid front unrounded vowel followed by close front unrounded vowel)
- ue corresponds to /u/ (close back unrounded vowel)
- French
- ai represents /ɛ/ (open-mid front unrounded vowel)
- au represents /o/ (close-mid back rounded vowel)
- ch represents /ʃ/ (voiceless postalveolar fricative)
- ou represents /u/ (close back rounded vowel) or /w/ (labio-velar approximant)
- gn represents /ɲ/ (palatal nasal)
- qu represents /k/ (voiceless velar stop), typically before historic front vowels
- See also French phonology
- German
- ae, as ä represents /ɛ/ (open-mid front unrounded vowel)
- ch represents /x/ (voiceless velar fricative) or /ç/ (voiceless palatal fricative)
- ck represents /k/ (voiceless velar plosive)
- ei represents /a͡ɪ/ (open front unrounded vowel) followed by (near-close near-front unrounded vowel)
- eu represents /ɔ͡ʏ/ (open-mid back rounded vowel) followed by (near-close near-front rounded vowel)
- oe, as ö represents /œ/ (open-mid front rounded vowel) or /ø/ (close-mid front rounded vowel)
- ss, as ß represents /s/ (voiceless alveolar fricative)
- ue, as ü represents /y/ (close front rounded vowel) or /ʏ/ (near-close near-front rounded vowel)
- Hungarian
- cs represents /tʃ/ (voiceless postalveolar affricate)
- gy represents /ɟ/ (voiced palatal plosive)
- ny represents /ɲ/ (palatal nasal)
- sz represents /s/ (voiceless alveolar fricative) (s is pronounced as /ʃ/)
- ty represents /c/ (voiceless palatal plosive)
- zs represents /ʒ/ (voiced postalveolar fricative)
- Italian
- gl represents /ʎ/, palatal lateral approximant, before -i (with some exceptions)
- gn represents /ɲ/ (palatal nasal)
- sc corresponds to /ʃ/, (voiceless postalveolar fricative) before -i and -e (but to /sk/ before other letters)
- ch corresponds to /k/ (only used before i, e)
- gh corresponds to /g/ (only used before i, e)
- Modern Greek
- αι (ai) represents /ɛ/
- ει (ei) represents /i/
- οι (oi) represents /i/
- ου (ou) represents /u/
- γκ (gk) represents /k/
- μπ (mp) represents /b/
- ντ (nt) represents /d/
Some of the above depend on context — see Greek alphabet.
- Polish
- ch corresponds to /x/ (voiceless velar fricative)
- cz corresponds to /tʂ/ (voiceless retroflex affricate)
- dz corresponds to /dz/ (voiced alveolar affricate)
- dź corresponds to /dʑ/ (voiced alveolo-palatal affricate)
- dż corresponds to /dʐ/ (voiced retroflex affricate)
- rz corresponds to /ʐ/ (voiced retroflex fricative)
- sz corresponds to /ʂ/ (voiceless retroflex fricative)
- Portuguese
- ch corresponds to /ʃ/ (voiceless postalveolar fricative)
- gu corresponds to /g/ (voiced velar plosive) before e or i
- lh corresponds to /ʎ/ (palatal lateral approximant)
- nh corresponds to /ɲ/ (palatal nasal)
- ou corresponds to /o/ (close-mid back rounded vowel)
- qu corresponds to /k/ (voiceless velar plosive) before e or i
- Serbian, Croatian
- lj corresponds to /ʎ/, palatal lateral approximant
- nj corresponds to /ɲ/ (palatal nasal)
- dž corresponds to /dʒ/ (voiced postalveolar affricate)
- Spanish
- ch represents /tʃ/ (voiceless postalveolar affricate)
- gu corresponds to /g/ (voiced velar plosive) before e or i
- qu corresponds to /k/ (voiceless velar plosive) before e or i
- Thai (transcription system)
- kh represents /kʰ/
- ph represents /pʰ/
- th represents /tʰ/
- ng represents /ŋ/
- Welsh
- ch represents /χ/ (voiceless uvular fricative), similar to French "r"
- ng represents /ŋ/ (velar nasal), the same sound as in English
- ph represents /f/ (voiceless labiodental fricative)
- rh represents /r̥/ a voiceless R, pronounced roughly like the English combination HR
- th represents /θ/ (voiceless interdental fricative)
Reversals
Reversals are sequences in which both possible orders of letters are common enough to be digraphs.
- English
- re corresponds to /ər/
- le corresponds to /əl/
Doubled letters
These have both letters the same. In some languages these indicate length, a stressed syllable or a new sound, and in some cases they are just part of the spelling convention. Ll is the most common in English, though it does not represent a different sound, but that is not the case in other languages; Welsh's ll is a voiceless lateral, and in Spanish it is a palatal lateral approximant /ʎ/ or else a palatal fricative. Ee and oo are common examples from English. Rr in Spanish and Italian indicates a trill, and forms a minimal pair with the single r. Italian's zz represents the affricate /ʦ/.
In Czech (and analogically in other Slavic languages) doubled letters occur in word-formation by prefixes and suffixes, and in composite words. Therefore, doubled letters are not considered as digraphs in Czech. Examples:
- bezzubý (toothless)
- cenný (valuable)
- černooký (black-eyed)
- English
- ll corresponds to /l/ (voiced alveolar lateral approximant)
- ee corresponds to /i/ (close front unrounded vowel)
- oo corresponds to /u/ (close back unrounded vowel)
- Dutch
- aa corresponds to /a/ (open front unrounded vowel)
- ee corresponds to /e/ (close-mid front unrounded vowel)
- oo corresponds to /o/ (close-mid back rounded vowel)
- uu corresponds to /y/ (close front rounded vowel)
- CC (where C means any consonant) corresponds to C and signifies that the preceding vowel is short.
- Portuguese
- rr corresponds to intervocalic guttural R
- ss corresponds to intervocalic /s/ (voiceless alveolar fricative)
- Welsh
- dd, a voiced dental fricative, like English then
- ff, the voiceless labiodental fricative, /f/ (like English f, as Welsh F is pronounced like English V)
- ll, a voiceless alveolar lateral fricative, /ɬ/