Standard German phonology

Updated on Apr 25, 2026

Edit

Comment

The phonology of Standard German is the standard pronunciation or accent of the German language. It deals with current phonology and phonetics as well as with historical developments thereof as well as the geographical variants and the influence of German dialects.

While the spelling of German is officially standardised by an international organisation (the Council for German Orthography) the pronunciation has no official standard and relies on a de facto standard documented in reference works such as Deutsches Aussprachewörterbuch (German Pronunciation Dictionary) by Eva-Maria Krech et al., Duden 6 Das Aussprachewörterbuch (Duden volume 6, The Pronunciation Dictionary) by Max Mangold and the training materials of radio and television stations such as Westdeutscher Rundfunk, Deutschlandfunk, or Schweizer Radio und Fernsehen. This standardised pronunciation was invented, rather than coming from any particular German-speaking city. Standard German is sometimes referred to as Bühnendeutsch (stage German), but the latter has its own definition and is slightly different.

Monophthongs

Some scholars treat /ə/ as an unstressed allophone of /ɛ/. Likewise, some scholars treat /ɐ/ as an allophone of the unstressed sequence /ər/. The phonemic status of /ɛː/ is also debated - see below.

Notes

Close vowels

/iː/ is close front unrounded [iː].

/yː/ is close near-front rounded [y̠ː]. Its rounding is compressed.

/uː/ is close back rounded [uː]. Its rounding is protruded.

/ɪ/ has been variously described as near-close front unrounded [ɪ̟] and near-close near-front unrounded [ɪ].

/ʏ/ is near-close near-front rounded [ʏ]. Its rounding is compressed.

/ʊ/ is near-close near-back rounded [ʊ]. Its rounding is protruded.

Mid vowels

/eː/ is close-mid front unrounded [eː].

In non-standard accents of the Low German speaking area, as well as in some Bavarian and Austrian accents it may be pronounced as a narrow closing diphthong [eɪ].

/øː/ has been variously described as close-mid near-front rounded [ø̠ː] and mid near-front rounded [ø̽ː]. Its rounding is compressed.

In non-standard accents of the Low German speaking area, as well as in some Austrian accents it may be pronounced as a narrow closing diphthong [øʏ].

/oː/ is close-mid back rounded [oː]. Its rounding is protruded.

In non-standard accents of the Low German speaking area, as well as in some Austrian accents it may be pronounced as a narrow closing diphthong [oʊ].

/ə/ has been variously described as mid central unrounded [ə]. and close-mid central unrounded [ɘ]. It occurs only in unstressed syllables, for instance in besetzen [bəˈzɛt͡sən] ('occupy'). It is often considered a complementary allophone together with [ɛ], which cannot occur in unstressed syllables. If a sonorant follows in the syllable coda, the schwa often disappears so that the sonorant becomes syllabic, for instance Kissen [ˈkɪsn̩] ('pillow'), Esel [ˈʔeːzl̩] ('donkey').

/ɛ/ has been variously described as mid near-front unrounded [ɛ̽] and open-mid front unrounded [ɛ].

/ɛː/ has been variously described as mid front unrounded [ɛ̝ː] and open-mid front unrounded [ɛː].

/œ/ has been variously described as open-mid near-front rounded [œ̠] and somewhat lowered open-mid near-front rounded [œ̠˕]. Its rounding is compressed.

/ɔ/ has been variously described as somewhat fronted open-mid back rounded [ɔ̟] and open-mid back rounded [ɔ]. Its rounding is protruded.

Open vowels

/ɐ/ is near-open central unrounded [ɐ]. It is a common allophone of the sequence /ər/ common to all German-speaking areas but Switzerland.

/a/ has been variously described as open front unrounded [a] and open central unrounded [ä]. Some scholars differentiate two short /a/, namely front /a/ and back /ɑ/. The latter occurs only in unstressed open syllables, exactly as /i, y, u, e, ø, o/.

Standard Austrian pronunciation of this vowel is back [ɑ].

Front [a] or even [æ] is a common realization of /a/ in northern German varieties influenced by Low German.

/aː/ has been variously described as open central unrounded [äː] and open back unrounded [ɑː]. Because of this, it is sometimes transcribed /ɑː/.

Back [ɑː] is the Standard Austrian pronunciation. It is also a common realization of /aː/ in northern German varieties influenced by Low German (in which it may even be rounded [ɒː]).

Wiese (1996) notes that "there is a tendency to neutralize the distinction between [a(ː)], [aɐ̯], and [ɐ]. That is, Oda, Radar, and Oder have final syllables which are perceptually very similar, and are nearly or completely identical in some dialects." He also says that "outside of a word context, [ɐ] cannot be distinguished from [a].

Although there is also a length contrast, vowels are often analyzed according to a tenseness contrast, with long /iː, yː, uː, eː, øː, oː/ being the tense vowels and short /ɪ, ʏ, ʊ, ɛ, œ, ɔ/ their lax counterparts. Like the English checked vowels, the German lax vowels require a following consonant, with the notable exception of [ɛː] (which is absent in many varieties, as discussed below). /a/ is sometimes considered the lax counterpart of tense /aː/ in order to maintain this tense/lax division. Short /i, y, u, e, ø, o/ occur in unstressed syllables of loanwords, for instance in Psychometrie /psyçomeˈtʁiː/ ('psychometry'). They are usually considered allophones of tense vowels, which cannot occur in unstressed syllables (unless in compounds).

Northern German varieties influenced by Low German could be analyzed as lacking contrasting vowel quantity entirely:

/aː/ has a different quality than /a/ (see above)

These varieties also consistently lack /ɛː/, and use only /eː/ in its place.

Phonemic status of /ɛː/

The long open-mid front unrounded vowel [ɛː] does not exist in many varieties of Standard German and is rendered as the close-mid front unrounded vowel [eː], so that both Ähre ('ear of grain') and Ehre ('honor') are pronounced [ˈʔeːʁə] (instead of "Ähre" being [ˈʔɛːʁə]) and both Bären ('bears') and Beeren ('berries') are pronounced [ˈbeːʁən] (instead of "Bären" being [ˈbɛːʁən]). It is debated whether [ɛː] is a distinct phoneme or even exists, except when consciously self-censoring speech, for several reasons:

The existence of a phoneme /ɛː/ is an irregularity in a vowel system that otherwise has pairs of long and tense vs. short and lax vowels such as [oː] vs. [ɔ];

The use of [ɛː] in Standard German is caused more by hypercorrection and the synthetically created pronunciation traditionally used on stage (Bühnendeutsch) than to a consistent dialectal difference. Although some dialects have an opposition of [eː] vs. [ɛː], there is little agreement across dialects as to whether individual lexical items should be pronounced with [eː] or with [ɛː];

The use of [ɛː] is a spelling pronunciation rather than an original feature of the language. It is an attempt to "speak as is printed" (sprechen wie gedruckt) and to differentiate the spellings ⟨e⟩ and ⟨ä⟩ (speakers of the language attempt to justify the appearance of ⟨e⟩ and ⟨ä⟩ in writing by making them distinct in the spoken language);

Speakers with an otherwise fairly standard idiolect find it rather difficult to utter longer passages with [eː] and [ɛː] in the right places. Such persons apparently have to picture the spellings of the words in question, which impedes the flow of speech.

Phonemic

/aɪ̯/ has been variously described as [äɪ], [äe̠] and [aɛ].

/aʊ̯/ has been variously described as [äʊ], [äʊ̞], [äo̟] and [aɔ].

/ɔʏ̯/ has been variously described as [ɔʏ], [ɔʏ̞], [ɔ̝e̠] and [ɔœ].

The process of smoothing is absent from standard German, so the sequences /aɪ̯ə, aʊ̯ə, ɔʏ̯ə/ are never pronounced *[aə̯, aə̯, ɔə̯] or *[aː, aː, ɔː].

Phonetic

Marginally, there are other diphthongs, for instance

[ʊɪ̯] in interjections such as pfui [p͡fʊɪ̯],

The following usually are not counted among the German diphthongs as German speakers often feel they are distinct marks of "foreign words" (Fremdwörter). These appear only in loanwords:

[o̯a], as in Croissant [kʁ̥o̯aˈsɑ̃], colloquially: [kʁ̥o̯aˈsaŋ].

Wiese (1996) states that many speakers of German will use the expression ok with [ɔʊ̯ˈkɛɪ̯] as a possible pronunciation quite frequently, and that alternatively, [ɔʊ̯] and [ɛɪ̯] can be monophthongized to [oː] and [eː], respectively. However, neither Mangold (2005) nor Krech et al. (2009) recognize these as phonemes. Instead, they prescribe pronunciations with, respectively, /oː/ and /eː/ in each loanword from English containing /oʊ/ and /eɪ/.

In the varieties where speakers vocalize /r/ to [ɐ] in the syllable coda, a diphthong ending in [ɐ̯] may be formed with every vowel except /ə/ and /ɐ/:

^1 Wiese (1996) notes that the length contrast is not very stable before non-prevocalic /r/ and that "Meinhold & Stock (1980:180), following the pronouncing dictionaries (Mangold (1990), Krech & Stötzer (1982)) judge the vowel in Art, Schwert, Fahrt to be long, while the vowel in Ort, Furcht, hart is supposed to be short. The factual basis of this presumed distinction seems very questionable." He goes on stating that in his own dialect, there is no length difference in these words, and that judgements on vowel length in front of non-prevocalic /r/ which is itself vocalized are problematic, in particular if /a/ precedes.According to the "lengthless" analysis, the aforementioned "long" diphthongs are analyzed as [iɐ̯], [yɐ̯], [uɐ̯], [ɛɐ̯], [eɐ̯], [øɐ̯], [oɐ̯] and [aɐ̯]. This makes non-prevocalic /ar/ and /aːr/ homophonous as [aɐ̯] or [aː]. Non-prevocalic /ɛr/ and /ɛːr/ may also merge, but the vowel chart in Kohler (1999) shows that they have somewhat different starting points - mid-centralized open-mid front [ɛ̽] for the former, open-mid front [ɛ] for the latter.Wiese (1996) also states that "laxing of the vowel is predicted to take place in shortened vowels; it does indeed seem to go hand in hand with the vowel shortening in many cases." This leads to [iɐ̯], [yɐ̯], [uɐ̯], [eɐ̯], [øɐ̯], [oɐ̯] being pronounced the same as [ɪɐ̯], [ʏɐ̯], [ʊɐ̯], [ɛɐ̯], [œɐ̯], [ɔɐ̯]. This merger is usual in the Standard Austrian accent, in which e.g. Mor 'bog' is often pronounced [mɔɐ̯]; this, in contrast with the Standard Northern variety, also happens intervocalically, along with the diphthongization of the laxed vowel to [Vɐ̯], so that e.g. Lehrer 'teacher' is pronounced [ˈlɛɐ̯ʁɐ] (the corresponding Standard Northern pronunciation is [ˈleːʁɐ]). Another feature of the Standard Austrian accent is complete absorption of [ɐ̯] by the preceding /ɑ, ɑː/, so that e.g. rar 'scarce' is pronounced [ʁɑː].

Consonants

With approximately 25 phonemes, the German consonant system has an average number of consonants in comparison with other languages. One of the more noteworthy ones is the unusual affricate /p͡f/.

Notes

/p͡f/ is bilabial–labiodental [p͡f], rather than purely labiodental [p̪͡f].

/t, d, l, n/ can be apical alveolar [t̺, d̺, l̺, n̺], laminal alveolar [t̻, d̻, l̻, n̻] or laminal denti-alveolar [t̪, d̪, l̪, n̪]. The other possible pronunciation of /d/ that has been reported to occur in unstressed intervocalic positions is retroflex [ɖ]. Austrian German often uses the laminal denti-alveolar articulation.

/l/ is always clear [l], as in most Irish English accents. A few Austrian accents may use a velarized [ɫ] instead, but that is considered non-standard.

In the Standard Austrian variety, /k/ may be affricated to [k͡x] before front vowels.

/t͡s, s, z/ can be laminal alveolar [t̻͡s̻, s̻, z̻], laminal post-dental [t̪͡s̪, s̪, z̪] (i.e. fronted alveolar, articulated with the blade of the tongue just behind upper front teeth), or even apical alveolar [t̺͡s̺, s̺, z̺]. Austrian German often uses the post-dental articulation. /s, z/ are always strongly fricated.

/t͡ʃ, d͡ʒ, ʃ, ʒ/ are strongly labialized palato-alveolar sibilants [t͡ʃʷ, d͡ʒʷ, ʃʷ, ʒʷ]. /ʃ, ʒ/ are fricated more weakly than /s, z/. There are two variants of these sounds:

Laminal, articulated with the foremost part of the blade of the tongue approaching the foremost part of the hard palate, with the tip of the tongue resting behind either upper or lower front teeth.

Apico-laminal, articulated with the tip of the tongue approaching the gums and the foremost part of the blade approaching the foremost part of the hard palate. According to Morciniec & Prędota (2005), this variant is used more frequently.

/θ, ð/ are used only in loanwords, mostly from English, such as Thriller /ˈθʁɪlɐ/, though some speakers substitute /θ/ with any of /t, s, f/ and /ð/ with any of /d, z, v/. There are two variants of these sounds:

Apical post-dental, articulated with the tip of the tongue approaching the upper incisors.

Apical interdental, articulated with the tip of the tongue between the upper and lower incisors.

/r/ has a number of possible realizations:

Voiced apical coronal trill/tap [r̺, ɾ̺], either alveolar (articulated with the tip of the tongue against the alveolar ridge), or dental (articulated with the tip of the tongue against the back of the upper front teeth).

Distribution: Common in the south (Bavaria and many parts of Switzerland and Austria), but it is also found in some speakers in central and northern Germany, especially the elderly. It is also one of possible realizations of /r/ in the Standard Austrian accent, but a more common alveolar realization is an approximant [ɹ]. Even more common are uvular realizations, fricatives [ʁ ~ χ] and a trill [ʀ].

Voiced uvular trill [ʀ], which can be realized as voiceless [ʀ̥] after voiceless consonants (as in treten). According to Lodge (2009) it is often a tap [ʀ̆] intervocalically (as in Ehre).

Distribution: Occurs in some conservative varieties - most speakers with a uvular /r/ realize it as a fricative or an approximant. It is also one of possible realizations of /r/ in the Standard Austrian accent, but it is less common than a fricative [ʁ ~ χ].

Dorsal continuant, about the quality of which there is not a complete agreement:

Krech et al. (2009) describe two fricative variants, namely post-palatal [ɣ˖] and velar [ɣ]. The post-palatal variant appears before and after front vowels, while the velar variant is used in all other positions.

Morciniec & Prędota (2005) describe it as voiced post-velar fricative [ʁ̟].

Mangold (2005) and Kohler (1999) describe it as voiced uvular fricative [ʁ];

Mangold (2005) states that "with educated professional radio and TV announcers, as with professional actors on the stage and in film, the [voiced uvular] fricative [realization of] /r/ clearly predominates."

In the Standard Austrian accent, the uvular fricative is also the most common realization, although its voicing is variable (that is, it can be either voiced [ʁ] or voiceless [χ]).

Kohler (1999) writes that "the place of articulation of the consonant varies from uvular in e.g. rot ('red') to velar in e.g. treten ('kick'), depending on back or front vowel contexts." He also notes that [ʁ] is devoiced after voiceless plosives and fricatives, especially those within the same word, giving the word treten as an example. According to this author, [ʁ] can be reduced to an approximant in an intervocalic position.

Ladefoged & Maddieson (1996) describe it as a uvular fricative [ʁ] or approximant [ʁ̞]. The latter is less likely to occur word-initially.

Distribution: Almost all areas apart from Bavaria and parts of Switzerland.

Near-open central unrounded vowel [ɐ] is a post-vocalic allophone of (mostly dorsal) varieties of /r/. The non-syllabic variant of it is not always near-open or central; it is similar to either [ɑ] or [ə], depending on the environment.

Distribution: Widespread, but less common in Switzerland.

The voiceless stops /p/, /t/, /k/ are aspirated except when preceded by a sibilant. Many southern dialects do not aspirate /p t k/, and some northern ones do so only in a stressed position. The voiceless affricates /p͡f/, /t͡s/, and /t͡ʃ/ are never aspirated, and neither are any other consonants besides the aforementioned /p, t, k/.

The obstruents /b, d, ɡ, z, ʒ, dʒ/ are voiceless lenis [b̥, d̥, ɡ̊, z̥, ʒ̊, d͜ʒ̊] in southern varieties, and they contrast with voiceless fortis [p, t, k, s, ʃ, t͡ʃ].

In Austria, intervocalic /b, d, ɡ/ can be lenited to fricatives [β, ð, ɣ].

Before and after front vowels (/ɪ, iː, ʏ, yː, ɛ, ɛː, eː, œ, øː/ and, in varieties that realize them as front, /a/ and/or /aː/), the velar consonants /ŋ, k, ɡ/ are realized as post-palatal [ŋ˖, k̟, ɡ˖]. According to Wiese (1996), in a parallel process, /k, ɡ/ before and after back vowels (/ʊ, uː, ɔ, oː/ and, in varieties that realize them as back, /a/ and/or /aː/) are retracted to post-velar [k̠, ɡ˗] or even uvular [q, ɢ].

There isn't a complete agreement about the nature of /j/; it has been variously described as a fricative [ʝ], a fricative, which can be fricated less strongly than /ç/, a sound variable between a weak fricative an approximant and an approximant [j], which is the usual realization in the Standard Austrian variety.

In standard usage and careful speech, [ʔ] occurs before word stems that begin with a vowel. Although not usually considered a phoneme, it may have phonemic value: will ich [vɪl ʔɪç] ('will I') vs. willig [ˈvɪlɪç] ('willing'). In colloquial and dialectal speech, however, /ʔ/ is very often omitted, especially when the word beginning with a vowel is unstressed.

The phonemic status of affricates is controversial. The majority view accepts /p͡f/ and /t͡s/, but not /t͡ʃ/ or the non-native /d͡ʒ/; some accept none, some accept all but /d͡ʒ/, and some accept all. [d͡ʒ] and [ʒ] occur only in words of foreign origin. In certain varieties, they are replaced by [t͡ʃ] and [ʃ] altogether.

[ʋ] is occasionally considered to be an allophone of /v/, especially in southern varieties of German.

[ç] and [x] are traditionally regarded as allophones after front vowels and back vowels, respectively. For a more detailed analysis see below at ich-Laut and ach-Laut. According to some analyses, [χ] is an allophone of /x/ after /a, aː/ and according to some also after /ʊ, ɔ, aʊ̯/. However, according to Moosmüller, Schmid & Brandstätter (2015), the uvular allophone is used after /ɔ/ only in the Standard Austrian variety.

Some phonologists deny the phoneme /ŋ/ and use /nɡ/ instead along with /nk/ instead of /ŋk/. The phoneme sequence /nɡ/ is realized as [ŋɡ] when /ɡ/ can start a valid onset of the next syllable whose nucleus is a vowel other than unstressed /ə/, /ɪ/, or /ʊ/. It becomes [ŋ] otherwise. For example:

Diphthong /ˈdɪftɔnɡ/ [ˈdɪftɔŋ]

diphthongieren /dɪftɔnˈɡiːʁən/ [ˌdɪftɔŋˈɡiːʁən]

Englisch /ˈɛnɡlɪʃ/ [ˈʔɛŋlɪʃ]

Anglo /ˈanɡloː/ [ˈʔaŋɡloː]

Ganges /ˈɡanɡəs/ [ˈɡaŋəs] ~ /ˈɡanɡɛs/ [ˈɡaŋɡɛs]

Ich-Laut and ach-Laut

Ich-Laut is the voiceless palatal fricative [ç] (which is found in the word ich [ʔɪç] 'I'), and ach-Laut is the voiceless velar fricative [x] (which is found in the word ach [ax] the interjection 'oh', 'alas'). Note that Laut [laʊ̯t] is the German word for 'sound, phone'. In German, these two sounds are allophones occurring in complementary distribution. The allophone [x] occurs after back vowels and /a aː/ (for instance in Buch [buːx] 'book'), the allophone [ç] after front vowels (for instance in mich [mɪç] 'me/myself') and consonants (for instance in Furcht [fʊʁçt] 'fear', manchmal [ˈmançmaːl] 'sometimes'). (This happens most regularly: if the ⟨r⟩ in Furcht is pronounced as a consonant, ch represents [ç]; however if, as often happens, it is vocalized as [ɐ], resembling the vowel [a], then ⟨ch⟩ may represent [x], yielding [fʊɐ̯xt].)

In loanwords, the pronunciation of potential fricatives in onsets of stressed syllables varies: in the Northern varieties of standard German, it is [ç], while in Southern varieties, it is [k], and in Western varieties, it is [ʃ] (for instance in China: [ˈçiːna] vs. [ˈkiːna] vs. [ˈʃiːna]).

The diminutive suffix -chen is always pronounced with an ich-Laut [-çən]. Usually, this ending triggers umlaut (compare for instance Hund [hʊnt] 'dog' to Hündchen [ˈhʏntçn̩] 'little dog'), so theoretically, it could only occur after front vowels. However, in some comparatively recent coinings, there is no longer an umlaut, for instance in the word Frauchen [ˈfʀaʊ̯çən] (a diminutive of Frau 'woman'), so that a back vowel is followed by a [ç], even though normally it would be followed by a [x], as in rauchen [ˈʀaʊ̯xən] ('to smoke'). This exception to the allophonic distribution may be an effect of the morphemic boundary or an example of phonemicization, where erstwhile allophones undergo a split into separate phonemes.

The allophonic distribution of [ç] after front vowels and [x] after other vowels is also found in other languages, such as Scots, in the pronunciation of light. However, it is by no means inevitable: Dutch, Yiddish, and many Southern German dialects retain [x] (which can be realized as [χ] instead) in all positions. It is thus reasonable to assume that Old High German ih, the ancestor of modern ich, was pronounced with [x] rather than [ç]. While it is impossible to know for certain whether Old English words such as niht (modern night) were pronounced with [x] or [ç], [ç] is likely (see Old English phonology).

Despite the phonetic history, the complementary distribution of [ç] and [x] in modern Standard German is better described as backing of /ç/ after a back vowel, rather than fronting of /x/ after a front vowel, because [ç] is used in onsets (Chemie [çeˈmiː] 'chemistry') and after consonants (Molch [mɔlç] 'newt'), and is thus the underlying form of the phoneme. This is an example of assimilation.

According to Kohler, the German ach-Laut is further differentiated into two allophones, [x] and [χ]: [x] occurs after /uː, oː/ (for instance in Buch [buːx] 'book') and [χ] after /a, aː/ (for instance in Bach [baχ] 'brook'), while either [x] or [χ] may occur after /ʊ, ɔ, aʊ̯/, with [χ] predominating.

Fortis–lenis pairs

Various German consonants occur in pairs at the same place of articulation and in the same manner of articulation, namely the pairs /p-b/, /t-d/, /k-ɡ/, /s-z/, /ʃ-ʒ/. These pairs are often called fortis–lenis pairs, since describing them as voiced–voiceless pairs is inadequate. With certain qualifications, /t͡ʃ-d͡ʒ/, /f-v/ are also considered fortis–lenis pairs.

Fortis-lenis distinction for /ʔ, m, n, ŋ, l, r, h/ is unimportant.

The fortis stops /p, t, k/ are aspirated in many varieties. The aspiration is strongest in the onset of a stressed syllable (such as Taler [ˈtʰaːlɐ] 'thaler'), weaker in the onset of an unstressed syllable (such as Vater [ˈfaːtʰɐ] 'father'), and weakest in the syllable coda (such as in Saat [zaːtʰ] 'seed'). All fortis consonants, i.e. /p, t, k, f, s, ʃ, ç, x, p͡f, t͡s, t͡ʃ, θ/ are fully voiceless.

The lenis consonants /b, d, ɡ, v, z, ʒ, j, r, d͡ʒ, ð/ range from being weakly voiced to almost voiceless [b̥, d̥, ɡ̊, v̥, z̥, ʒ̊, j̥, r̥, d͜ʒ̊, ð̥] after voiceless consonants: Kasbah [ˈkasb̥a] ('kasbah)', abdanken [ˈʔapd̥aŋkn̩] ('to resign'), rotgelb [ˈʁoːtɡ̊ɛlp] ('red-yellow'), Abwurf [ˈʔapv̥ʊʁf] ('dropping'), Absicht [ˈʔapz̥ɪçt] ('intention'), Holzjalousie [ˈhɔlt͜sʒ̊aluziː] ('wooden jalousie'), wegjagen [ˈvɛkj̥aːɡn̩] ('to chase away'), tropfen [ˈtʁ̥ɔp͡fn̩] ('to drop'), Obstjuice [ˈʔoːpstd͜ʒ̊uːs] ('fruit juice'). Mangold (2005) states that they are "to a large extent voiced" [b, d, g, v, z, ʒ, j, r, d͡ʒ, ð] in all other environments, but some studies have found the stops /b, d, ɡ/ to be voiceless word/utterance-initially in most dialects (while still contrasting with /p, t, k/ due to the aspiration of the latter).

/b, d, ɡ, z, ʒ/ are voiceless in most southern varieties of German. For clarity, they are often transcribed as [b̥, d̥, ɡ̊, z̥, ʒ̊].

The nature of the phonetic difference between the voiceless lenis consonants and the similarly voiceless fortis consonants is controversial. It is generally described as a difference in articulatory force, and occasionally as a difference in articulatory length; for the most part, it is assumed that one of these characteristics implies the other.

In various central and southern varieties, the opposition between fortis and lenis is neutralized in the syllable onset; sometimes just in the onset of stressed syllables, sometimes in all cases.

The pair /f-v/ is not considered a fortis–lenis pair, but a simple voiceless–voiced pair, as /v/ remains voiced in all varieties, including the Southern varieties that devoice the lenes (with however some exceptions). Generally, the southern /v/ is realized as the voiced approximant [ʋ]. However, there are southern varieties which differentiate between a fortis /f/ (such as in sträflich [ˈʃtrɛːflɪç] 'culpable' from Middle High German stræflich) and a lenis /f/ ([v̥], such as in höflich [ˈhøːv̥lɪç] 'polite' from Middle High German hovelîch); this is analogous to the opposition of fortis /s/ ([s]) and lenis [z̥].

Coda devoicing

In varieties from Northern Germany, lenis stops in the syllable coda are realized as fortis stops. This does not happen in varieties from Southern Germany, Austria or Switzerland.

Since the lenis stops /b, d, ɡ/ are unvoiced or at most variably voiced (as stated above), this cannot be called devoicing in the strict sense of the word because it does not involve the loss of phonetic voice. More accurately, it can be called coda fortition or a neutralization of fortis and lenis sounds in the coda. Fricatives are truly and contrastively voiced in Northern Germany. Therefore, the fricatives undergo coda devoicing in the strict sense of the word. It is disputed whether coda devoicing is due to a constraint which specifically operates on syllable codas or whether it arises from constraints which "protect voicing in privileged positions."

As against standard pronunciation rules, in western varieties including those of the Rhineland, coda fortis–lenis neutralization results in voicing rather than devoicing if the following word begins with a vowel. For example, mit uns becomes [mɪd‿ʊns] and darf ich becomes [daʁv‿ɪç]. The same sandhi phenomenon exists also as a general rule in the Luxembourgish language.

Stress

Stress in German usually falls on the first syllable, with the following exceptions:

Many loanwords, especially proper names, keep their original stress. E.g. Obama /oˈbaː.ma/

Nouns formed with Latinate suffixes, such as -ant, -anz, -enz, -ion, -ismus, -ist, -ment, -tät: Idealismus /ide.aˈlɪsmʊs/ ('idealism'), Konsonant /kɔnzoˈnant/ ('consonant'), Tourist /tuˈʁɪst/ ('tourist')

Verbs formed with the Latinate suffix -ieren, e.g. studieren /ʃtuˈdiːʁən/ ('to study'). This is often pronounced /iːɐ̯n/ in casual speech.

Compound adverbs, with her, hin, da, or wo as their first syllable part, receive stress on their second syllable, e.g. dagegen /daˈɡeːɡən/ ('on the other hand'), woher /voˈheːɐ̯/ ('from where')

Moreover, German makes a distinction in stress between separable prefixes (stress on prefix) and inseparable prefixes (stress on root) in verbs and words derived from such verbs. Therefore:

Words beginning with be-, ge-, er-, ver-, zer-, ent-, emp- and a few others receive stress on the second syllable.

Words having ab-, auf-, ein-, vor- as verb prefix, and most other prepositional adverbs receive stress on their first syllable.

Some prefixes, notably über-, unter-, um-, and durch-, can function as separable or inseparable prefixes, and are stressed and unstressed accordingly.

Rarely, two homographs with such prefixes are formed. They are not strictly homophones. Consider the word, umschreiben. As um•schreiben (separable prefix), it means 'to rewrite', and is pronounced [ˈʔʊmʃʀaɪ̯bən], and its associated noun, die Umschreibung also receives stress on the first syllable - [ˈʔʊmʃʀaɪ̯bʊŋ]. On the other hand, umschreiben (inseparable prefix) is pronounced [ʔʊmˈʃʀaɪ̯bən]. This word means 'to circumscribe', and its associated noun, die Umschreibung ('circumscription') also receives stress on the second syllable - [ʔʊmˈʃʀaɪ̯bʊŋ]. Another example is the word umfahren; with stress on the root ([ʔʊmˈfaːʀən]) it means 'to drive around (an obstacle in the street)', and with stress on the prefix ([ˈʔʊmfaːʀən]) it means 'to drive over' or 'to collide with (an object on the street).'

General

Like all infants, German infants go through a babbling stage in the early phases of phonological acquisition, during which they produce the sounds they will later use in their first words. Phoneme inventories begin with stops, nasals, and vowels; (contrasting) short vowels and liquids appear next, followed by fricatives and affricates, and finally all other consonants and consonant clusters. Children begin to produce protowords near the end of their first year. These words do not approximate adult forms, yet have a specific and consistent meaning. Early word productions are phonetically simple and usually follow the syllable structure CV or CVC, although this generalization has been challenged. The first vowels produced are /ə/, /a/, and /aː/, followed by /e/, /i/, and /ɛ/, with rounded vowels emerging last. German children often use phonological processes to simplify their early word production. For example, they may delete an unstressed syllable (Schokolade 'chocolate' pronounced [ˈlaːdə]), or replace a fricative with a corresponding stop (Dach [dax] 'roof' pronounced [dak]). One case study found that a 17-month-old child acquiring German replaced the voiceless velar fricative [x] with the nearest available continuant [h], or deleted it altogether (Buch [buːx] 'book' pronounced [buh] or [buː]).

Vowel space development

In 2009, Lintfert examined the development of vowel space of German speakers in their first three years of life. During the babbling stage, vowel distribution has no clear pattern. However, stressed and unstressed vowels already show different distributions in the vowel space. Once word production begins, stressed vowels expand in the vowel space, while the F1-F2 vowel space of unstressed vowels becomes more centralized. The majority of infants are then capable of stable production of F1. It should be noted that the variability of formant frequencies among individuals decreases with age. After 24 months, infants expand their vowel space individually at different rates. However, if the parents' utterances possess a well-defined vowel space, their children produce clearly distinguished vowel classes earlier. By about three years old, children command the production of all vowels, and they attempt to produce the four cardinal vowels, /y/, /i/, /u/ and /a/, at the extreme limits of the F1-F2 vowel space (i.e., the height and backness of the vowels are made extreme by the infants).

Grammatical words

Generally, closed-class grammatical words (e.g. articles and prepositions) are absent from children's speech when they first begin to combine words. However, children as young as 18 months old show knowledge of these closed-class words when they prefer stories with them, compared to passages with them omitted. Therefore, the absence of these grammatical words cannot be due to perceptual problems. Researchers tested children's comprehension of four grammatical words: bis [bɪs] ('up to'), von [fɔn] ('from'), das [das] ('the' neuter singular), and sein [zaɪ̯n] ('his'). After first being familiarized with the words, eight-month-old children looked longer in the direction of a speaker playing a text passage that contained these previously heard words. However, this ability is absent in six-month-olds.

Nasals

The acquisition of nasals in German differs from that of Dutch, a phonologically closely related language. German children produce proportionately more nasals in onset position (sounds before a vowel in a syllable) than Dutch children do. German children, once they reached 16 months old, also produced significantly more nasals in syllables containing schwas, when compared with Dutch-speaking children. This may reflect differences in the languages the children are being exposed to, although the researchers claim that the development of nasals likely cannot be seen apart from the more general phonological system the child is developing.

Phonotactic constraints and reading

A 2006 study examined the acquisition of German in phonologically delayed children (specifically, issues with fronting of velars and stopping of fricatives) and whether they applied phonotactic constraints to word-initial consonant clusters containing these modified consonants. In many cases, the subjects (mean age = 5;1) avoided making phonotactic violations, opting instead for other consonants or clusters in their speech. This suggests that phonotactic constraints do apply to the speech of German children with phonological delay, at least in the case of word-initial consonant clusters. Additional research has also shown that spelling consistencies seen in German raise children's phonemic awareness as they acquire reading skills.

Sound changes and mergers

A merger found mostly in Northern accents of German is that of /ɛː/ (spelled ⟨ä, äh⟩) with /eː/ (spelled ⟨e⟩, ⟨ee⟩, or ⟨eh⟩). Some speakers merge the two everywhere, some distinguish them everywhere, others keep /ɛː/ distinct only in conditional forms of strong verbs (for example ich gäbe [ˈɡɛːbə] 'I would give' vs. ich gebe [ˈɡeːbə] 'I give' are distinguished, but Bären [ˈbeːʁən] 'bears' vs. Beeren [ˈbeːʁən] 'berries' are not. Standard pronunciation of Bären is [ˈbɛːʁən]).

Another common merger is that of /ɡ/ at the end of a syllable with [ç] or [x], for instance Krieg [kʁ̥iːç] ('war'), but Kriege [ˈkʁ̥iːɡə] ('wars'); er lag [laːx] ('he lay'), but wir lagen [ˈlaːɡən] ('we lay'). This pronunciation is frequent all over central and northern Germany. It is characteristic of regional languages and dialects, particularly Low German in the North, where ⟨g⟩ represents a fricative, becoming voiceless in the syllable coda, as is common in German (final-obstruent devoicing). However common it is, this pronunciation is considered sub-standard. Only in one case, in the grammatical ending -ig (which corresponds to English -y), the fricative pronunciation of final ⟨g⟩ is prescribed by the Siebs standard, for instance wichtig [ˈvɪçtɪç] ('important'). The merger occurs neither in Austro-Bavarian and Alemannic German nor in the corresponding varieties of Standard German, and therefore in these regions -ig is pronounced [ɪɡ̊].

Many speakers do not distinguish the affricate /pf/ from the simple fricative /f/ in the beginning of a word. The verb (er) fährt ('[he] travels') and the noun Pferd ('horse') are then equally pronounced [fɛɐ̯t]. This occurs especially in regions where /p͡f/ did not originally occur in the local dialects, i.e. northern and western Germany. Some speakers also have peculiar pronunciation for /p͡f/ in the middle or end of a word, replacing the [f] in /p͡f/ with a voiceless bilabial fricative, i.e. a consonant produced by pressing air flow through the tensed lips. Thereby Tropfen ('drop') becomes [ˈtʁ̥ɔp͡ɸn̩], rather than [ˈtʁ̥ɔpf͡n̩].

Many speakers (especially in the North) who have a vocalization of [ʁ] after [a], merge this combination with long [aː] (i.e. [aʁ] > [aɐ] > [aː] or [äː]). Hereby, Schaf ('sheep') and scharf ('sharp') can both be pronounced [ʃaːf]. This merger does not occur where /aː/ is realised as a back vowel, thus keeping the words distinct as [ʃɑːf] and [ʃaːf]. However, in both Bavarian and Franconian dialects, the latter would always be pronounced [ʃarf] with a distinct [r] sound. Furthermore, in umlaut forms, the difference usually reoccurs: Schäfer [ˈʃɛːfɐ] vs. schärfer [ˈʃɛɐ̯fɐ]. Speakers with this merger also often use [aːç] (instead of formally normal [aːx]) where it stems from original [aʁç]. The word Archen ('arks') is thus pronounced [ˈʔaːçn̩], which makes a minimal pair with Aachen [ʔaːxn̩], making the difference between [ç] and [x] phonemic, rather than just allophonic, for these speakers.

In the standard pronunciation, the vowel qualities /i/, /ɪ/, /e/, /ɛ/, as well as /u/, /ʊ/, /o/, /ɔ/, are all still distinguished even in unstressed syllables. In this latter case, however, many simplify the system in various degrees. For some speakers, this may go so far as to merge all four into one, whence misspellings by schoolchildren such as Bräutegam (instead of Bräutigam) or Portogal (instead of Portugal).

In everyday speech, more mergers occur, some of which are universal and some of which are typical for certain regions or dialect backgrounds. Overall, there is a strong tendency of reduction and contraction. For example, long vowels may be shortened, consonant clusters may be simplified, word-final [ə] may be dropped in some cases, and the suffix -en may be contracted with preceding consonants, e.g. [ham] for haben [ˈhaːbən] ('to have').

When stops occur between two nasals (one being syllabic), they may be replaced by a glottal stop though they still determine the nature of the nasal. Thus, Lampen ('lamps') changes from [ˈlampən] to [ˈlamʔm̩]; speakers are often unaware of this.

If the clusters [mp], [lt], [nt], or [ŋk] are followed by another consonant, the stops /p/, /t/ and /k/ usually lose their phonemic status. Thus while the standard pronunciation distinguishes ganz [ɡant͡s] ('whole') from Gans [ɡans] ('goose'), as well as er sinkt [zɪŋkt] from er singt [zɪŋt], the two pairs are homophones for most speakers. The commonest practice is to drop the stop (thus [ɡans], [zɪŋt] for both words), but some speakers insert the stop where it is not etymological ([ɡants], [zɪŋkt] for both words), or they alternate between the two ways. Only few speakers retain a phonemic distinction.

Middle High German

The Middle High German vowels [ei̯] and [iː] developed into the modern Standard German diphthong [aɪ̯], whereas [ou̯] and [uː] developed into [aʊ̯]. For example, Middle High German heiz /hei̯s/ and wîz /wiːs/ ('hot' and 'white') became Standard German heiß /haɪ̯s/ and weiß /vaɪ̯s/. In some dialects, the Middle High German vowels have not changed, e.g. Swiss German heiss /hei̯s/ and wiiss /viːs/, while in other dialects or languages, the vowels have changed but the distinction is kept, e.g. Bavarian hoaß /hɔɐ̯s/ and weiß /vaɪ̯s/, Ripuarian heeß /heːs/ and wieß /viːs/, Yiddish הײס heys /hɛɪ̯s/ and װײַס vays /vaɪ̯s/.

The Middle High German diphthongs [iə̯], [uə̯] and [yə̯] became the modern Standard German long vowels [iː], [uː] and [yː] after the Middle High German long vowels changed to diphthongs. Most Upper German dialects retain the diphthongs. A remnant of their former diphthong character is shown when [iː] continues to be written ie in German (as in Liebe 'love').

Loanwords

German incorporates a significant number of loanwords from other languages. Loanwords are often adapted to German phonology but to varying degrees, depending on the speaker and the commonness of the word. /ʒ/ and /d͡ʒ/ do not occur in native German words but are common in a number of French and English loan words. Many speakers replace them with /ʃ/ and /t͡ʃ/ respectively (especially in Southern Germany, Austria and Switzerland), so that Dschungel (from English jungle) can be pronounced [ˈd͡ʒʊŋl̩] or [ˈt͡ʃʊŋl̩]. Some speakers in Northern and Western Germany merge /ʒ/ with /d͡ʒ/, so that Journalist (phonemically /d͡ʒʊʁnaˈlɪst ~ ʒʊʁnaˈlɪst/) can be pronounced [ʒʊɐ̯naˈlɪst], [d͡ʒʊɐ̯naˈlɪst] or [ʃʊɐ̯naˈlɪst]. The realization of /ʒ/ as [t͡ʃ], however, is uncommon.

Loanwords from English

Many English words are used in German, especially in technology and pop culture. Some speakers pronounce them similarly to their native pronunciation, but many speakers change non-native phonemes to similar German phonemes:

English /θ, ð/ are usually pronounced as in RP or General American; some speakers replace them with /s/ and /z/ respectively (th-alveolarization) e.g. Thriller [ˈθʁɪlɐ ~ ˈsʁɪlɐ].

English /ɹ/ can be pronounced the same as in English, i.e. [ɹ], or as the corresponding native German /r/ e.g. Rock [ʀɔk] or [rɔk]. German and Austrian speakers tend to be variably rhotic.

English /w/ is often replaced with German /v/ e.g. Whiskey [ˈvɪskiː].

word-initial /s/ is often retained (especially in the South, where word-initial /s/ is common), but many speakers replace it with /z/ e.g. Sound [zaʊ̯nt].

word-initial /st/ and /sp/ are usually retained, but some speakers (especially in South Western Germany and Western Austria) replace them with /ʃt/ and respectively /ʃp/ e.g. Steak [ʃteɪk] or [ʃteːk], Spray [ʃpʁeɪ] or [ʃpʁeː].

English /t͡ʃ/ is usually retained, but in Northern and Western Germany, as well as Luxembourg it is often replaced with /ʃ/ e.g. Chips [ʃɪps].

In Northern Standard German, final-obstruent devoicing is applied to English loan words just as to other words e.g. Airbag [ˈɛːɐ̯bɛk], Lord [lɔʁt] or [lɔɐ̯t], Backstage [ˈbɛksteːt͡ʃ]. However, in Southern Standard German, in Swiss Standard German and Austrian Standard German, final-obstruent devoicing does not occur and so speakers are more likely to retain the original pronunciation of word-final lenes (although realizing them as fortes may occur because of confusing English spelling with pronunciation).

English /eɪ/ and /oʊ/ are often replaced with /eː/ and /oː/ respectively e.g. Homepage [ˈhoːmpeːt͡ʃ].

English /æ/ and /ɛ/ are pronounced the same, as German /ɛ/ (met–mat merger) e.g. Backup [ˈbɛkap].

English /ɒ/ and /ɔː/ are pronounced the same, as German /ɔ/ (cot–caught merger) e.g. Box [bɔks].

English /ʌ/ is usually pronounced as German /a/ e.g. Cutter [ˈkatɐ].

English /ɜːr/ is usually pronounced as German /œʁ/ e.g. Shirt [ʃœʁt] or [ʃœɐ̯t].

English /i/ is pronounced as German /iː/ (happy-tensing) e.g. Whiskey [ˈvɪskiː].

Sample

The sample text is a reading of the first sentence of The North Wind and the Sun. The phonemic transcription treats every instance of [ɐ] and [ɐ̯] as /ər/ and /r/, respectively. The phonetic transcription is a fairly narrow transcription of the educated northern accent. The speaker transcribed in the narrow transcription is 62 years old, and he is reading in a colloquial style. Aspiration, glottal stops and devoicing of the lenes after fortes are not transcribed.

Note that the audio file contains the whole fable, and that it was recorded by a much younger speaker.

Phonemic transcription

/aɪ̯nst ˈʃtrɪtən zɪç ˈnɔrtvɪnt ʊnt ˈzɔnə | veːr fɔn iːnən ˈbaɪ̯dən voːl deːr ˈʃtɛrkərə vɛːrə | als aɪ̯n ˈvandərər | deːr ɪn aɪ̯nən ˈvarmən ˈmantəl ɡəˌhʏlt var | dɛs ˈveːɡəs daˈheːrkaːm/

Phonetic transcription

[aɪ̯ns ˈʃtʁɪtn̩ zɪç ˈnɔɐ̯tvɪnt ʊn ˈzɔnə | veːɐ̯ fən iːm ˈbaɪ̯dn̩ voːl dɐ ˈʃtɛɐ̯kəʁə veːʁə | als aɪ̯n ˈvandəʁɐ | dɛɐ̯ ɪn aɪ̯n ˈvaɐ̯m ˈmantl̩ ɡəˌhʏlt vaɐ̯ | dəs ˈveːɡəs daˈheːɐ̯kaːm]

Orthographic version

Einst stritten sich Nordwind und Sonne, wer von ihnen beiden wohl der Stärkere wäre, als ein Wanderer, der in einen warmen Mantel gehüllt war, des Weges daherkam.

References

Standard German phonology Wikipedia

(Text) CC BY-SA

Contents