Uralic languages

Updated on Oct 06, 2024

Edit

Comment

Proto-language Proto-Uralic Glottolog ural1272		ISO 639-5 urj

Geographicdistribution Central, Eastern, and Northern Europe, North Asia Linguistic classification One of the world's primary language families Subdivisions FinnicHungarianKhantyMansiMariMordvinicPermicSamiSamoyedic

The Uralic languages /jʊˈrælᵻk/ (sometimes called Uralian languages /jʊˈreɪliən/) constitute a language family of 38 languages spoken by approximately 25 million people, predominantly in Northern Eurasia. The Uralic languages with the most native speakers are Hungarian, Finnish, and Estonian, which are official languages of Hungary, Finland, and Estonia, respectively, and of the European Union. Other Uralic languages with significant numbers of speakers are Erzya, Moksha, Mari, Udmurt, and Komi, which are officially recognized languages in various regions of Russia.

The name "Uralic" derives from the fact that areas where the languages are spoken spread on both sides of the Ural Mountains. Also, the original homeland (Urheimat) is commonly hypothesized to lie in the vicinity of the Urals.

Finno-Ugric is sometimes used as a synonym for Uralic, though Finno-Ugric is widely understood to exclude the Samoyedic languages. Scholars who do not accept the traditional notion that Samoyedic split first from the rest of the Uralic family, such as Tapani Salminen, may treat both terms as synonymous.

Homeland

In recent times, linguists often place the Urheimat (original homeland) of the Proto-Uralic language in the vicinity of the Volga River, west of the Urals, close to the Urheimat of the Indo-European languages, or to the east and southeast of the Urals. Gyula László places its origin in the forest zone between the Oka River and central Poland. E. N. Setälä and M. Zsirai place it between the Volga and Kama Rivers. According to E. Itkonen, the ancestral area extended to the Baltic Sea. P. Hajdu has suggested a homeland in western and northwestern Siberia.

Early attestations

The first plausible mention of a Uralic people is in Tacitus's Germania (c. 98 AD), mentioning the Fenni (usually interpreted as referring to the Sami) and two other possibly Uralic tribes living in the farthest reaches of Scandinavia. There are many possible earlier mentions, including the Iyrcae (perhaps related to Yugra) described by Herodotus living in what is now European Russia, and the Budini, described by Herodotus as notably red-haired (a characteristic feature of the Udmurts) and living in northeast Ukraine and/or adjacent parts of Russia. In the late 15th century, European scholars noted the resemblance of the names Hungaria and Yugria, the names of settlements east of the Ural. They assumed a connection but did not seek linguistic evidence.

Uralic studies

The affinity of Hungarian and Finnish was first proposed in the late 17th century. Two of the more important contributors were the German scholar Martin Vogel, who established several grammatical and lexical parallels between Finnish and Hungarian, and the Swedish scholar Georg Stiernhielm, who commented on the similarities of Sami, Estonian and Finnish, and also on a few similar words between Finnish and Hungarian. These two authors were thus the first to outline what was to become the classification of the Finno-Ugric, and later Uralic family. This proposal received some of its initial impetus from the fact that these languages, unlike most of the other languages spoken in Europe, are not part of what is now known as the Indo-European family.

In 1717, Swedish professor Olof Rudbeck proposed about 100 etymologies connecting Finnish and Hungarian, of which about 40 are still considered valid. In the same year, the German scholar Johann Georg von Eckhart, in an essay published in Leibniz's Collectanea Etymologica, proposed for the first time a relation to the Samoyedic languages.

Philip Johan von Strahlenberg in 1730 published his book Das Nord- und Ostliche Theil von Europa und Asia, surveying the geography, peoples and languages of Russia. All the main groups of the Uralic languages were already identified here. Nonetheless, these relationships were not widely accepted. Hungarian intellectuals especially were not interested in the theory and preferred to assume connections with Turkic tribes, an attitude characterized by Ruhlen (1987) as due to "the wild unfettered Romanticism of the epoch". Still, in spite of this hostile climate, the Hungarian Jesuit János Sajnovics travelled with Maximilian Hell to survey the alleged relationship between Hungarian and Sami. Sajnovics published his results in 1770, arguing for a relationship based on several grammatical features. In 1799, the Hungarian Sámuel Gyarmathi published the most complete work on Finno-Ugric to that date.

At the beginning of the 19th century, research on this family was thus more advanced than Indo-European research. But the rise of Indo-European comparative linguistics absorbed so much attention and enthusiasm that Uralic linguistics was all but eclipsed in Europe; in Hungary, the only European country that would have had a vested interest in the family (Finland and Estonia being under Russian rule), the political climate was too hostile for the development of Uralic comparative linguistics.

Progress resumed after a number of decades with the first major field research expeditions on the Uralic languages spoken in the more eastern parts of Russia. These were made in the 1840s by Matthias Castrén (1813–1852) and Antal Reguly (1819–1858), who focused especially on the Samoyedic and the Ob-Ugric languages, respectively. Reguly's materials were worked on by the Hungarian linguist Pál Hunfalvy (1810–1891) and German Josef Budenz (1836–1892), who both supported the Uralic affinity of Hungarian. Budenz was the first scholar to bring this result to popular consciousness in Hungary, and to attempt a reconstruction of the Proto-Finno-Ugric grammar and lexicon. Another late-19th-century Hungarian contribution is that of Ignácz Halász (1855–1901), who published extensive comparative material of Finno-Ugric and Samoyedic in the 1890s, and whose work is at the base of today's wide acceptance of the inclusion of Samoyedic as a part of Uralic. Meanwhile, in the autonomous Grand Duchy of Finland, a chair for Finnish language and linguistics at the University of Helsinki was created in 1850, first held by Castrén.

In 1883 the Finno-Ugrian Society was founded in Helsinki on the proposal of Otto Donner, and during the late 19th and early 20th century (until the separation of Finland from Russia following the Russian revolution), a large number of stipendiates were sent by the Society to survey the less known Uralic languages. Major researchers of this period included Heikki Paasonen (studying especially the Mordvinic languages), Yrjö Wichmann (fi) (Permic), Artturi Kannisto (fi) (Mansi), Kustaa Fredrik Karjalainen (Khanty), Toivo Lehtisalo (Nenets), and Kai Donner (Kamass). The vast amounts of data collected on these expeditions would provide edition work for later generations of Finnish Uralicists for more than a century.

Classification

The Uralic family comprises nine undisputed groups with no consensus classification between them. (Some of the proposals are listed in the next section.) An agnostic approach treats them as separate branches.

Obsolete or native names are displayed in italics.

Finnic (Fennic, Baltic Finnic, Balto-Finnic, Balto-Fennic)

Hungarian (Magyar)

Khanty (Ostyak, Handi, Hantõ)

Mansi (Vogul)

Mari (Cheremis)

Mordvinic (Mordvin, Mordvinian)

Permic (Permian)

Sami (Saami, Samic, Saamic, Lappic, Lappish)

Samoyedic (Samoyed)

There is also historical evidence of a number of extinct languages of uncertain affiliation:

Merya

Muromian

Meshcherian (until 16th century?)

Traces of Finno-Ugric substrata, especially in toponymy, in the northern part of European Russia have been proposed as evidence for even more extinct Uralic languages.

Traditional classification

All Uralic languages are thought to have descended, through independent processes of language change, from Proto-Uralic. The internal structure of the Uralic family has been debated since the family was first proposed. Doubts about the validity of most of the proposed higher-order branchings (grouping the nine undisputed families) are becoming more common.

A traditional classification of the Uralic languages has existed since the late 19th century, tracing back to Donner (1879). It has enjoyed frequent adaptation in whole or in part in encyclopedias, handbooks, and overviews of the Uralic family. Donner's model is as follows:

Ugric (Ugrian)

Hungarian

Ob-Ugric (Ob Ugrian)

Khanty

Mansi

Finno-Permic (Permian-Finnic)

Permic

Finno-Volgaic (Finno-Cheremisic, Finno-Mari)

Volga-Finnic

Mari

Mordvinic

Finno-Lappic (Finno-Saamic, Finno-Samic)

Sami

Finnic

At Donner's time, the Samoyedic languages were still poorly known, and he was not able to address their position. As they became better known in the early 20th century, they were found to be quite divergent, and they were assumed to have separated already early on. The terminology adopted for this was "Uralic" for the entire family, "Finno-Ugric" for the non-Samoyedic languages (though "Finno-Ugric" has, to this day, remained in use also as a synonym for the whole family). Finno-Ugric and Samoyedic are listed in ISO 639-5 as primary branches of Uralic.

Nodes of the traditional family tree recognized in some overview sources:

Hajdú describes the Ugric and Volgaic groups as areal units.
Austerlitz however accepts narrower-than-traditional Finno-Ugric and Finno-Permic groups that exclude Samic.

Little explicit evidence has however been presented in favor of Donner's model since his original proposal, and numerous alternate schemes have been proposed. Especially in Finland there has been a growing tendency to reject the Finno-Ugric intermediate protolanguage. A recent competing proposal instead unites Ugric and Samoyedic in an "East Uralic" group for which shared innovations can be noted.

The Finno-Permic grouping still holds some support, though the arrangement of its subgroups is a matter of some dispute. Mordvinic is commonly seen as particularly closely related to or part of Finno-Samic. The term Volgaic (or Volga-Finnic) was used to denote a branch previously believed to include Mari, Mordvinic and a number of the extinct languages, but it is now obsolete and considered a geographic classification rather than a linguistic one.

Within Ugric, uniting Mansi with Hungarian rather than Khanty has been a competing hypothesis to Ob-Ugric.

Lexical isoglosses

Lexicostatistics has been used in defense of the traditional family tree. A recent re-evaluation of the evidence however fails to find support for Finno-Ugric and Ugric, suggesting four lexically distinct branches (Finno-Permic, Hungarian, Ob-Ugric and Samoyedic).

One alternate proposal for a family tree, with emphasis on the development of numerals, is as follows:

Uralic (*kektä "2", *wixti "5" / "10")

Samoyedic (*op "1", *ketä "2", *näkur "3", *tettə "4", *səmpəleŋkə "5", *məktut "6", *sejtwə "7", *wiət "10")

Finno-Ugric (*üki/*ükti "1", *kormi "3", *ńeljä "4", *wiiti "5", *kuuti "6", *luki "10")

Mansic

Mansi

Hungarian (hét "7"; replacement egy "1")

Finno-Khantic (reshaping *kolmi "3" on the analogy of "4")

Khanty

Finno-Permic (reshaping *kektä > *kakta)

Permic

Finno-Volgaic (*śećem "7")

Mari

Finno-Saamic (*kakteksa, *ükteksa "8, 9")

Saamic

Finno-Mordvinic (replacement *kümmen "10" (*luki- "to count", "to read out"))

Mordvinic

Finnic

Phonological isoglosses

Another proposed tree, more divergent from the standard, focusing on consonant isoglosses (which does not consider the position of the Samoyedic languages) is presented by Viitso (1997), and refined in Viitso (2000):

Finno-Ugric

Saamic–Fennic (consonant gradation)

Saamic

Fennic

Eastern Finno-Ugric

Mordva

(node)

Mari

Permian–Ugric (*δ > *l)

Permian

Ugric (*s *š *ś > *ɬ *ɬ *s)

Hungarian

Khanty

Mansi

The grouping of the four bottom-level branches remains to some degree open to interpretation, with competing models of Finno-Saamic vs. Eastern Finno-Ugric (Mari, Mordvinic, Permic-Ugric; *k > ɣ between vowels, degemination of stops) and Finno-Volgaic (Finno-Saamic, Mari, Mordvinic; *δ́ > δ between vowels) vs. Permic-Ugric. Viitso finds no evidence for a Finno-Permic grouping.

Extending this approach to cover the Samoyedic languages suggests affinity with Ugric, resulting in the aforementioned East Uralic grouping, as it also shares the same sibilant developments. A further non-trivial Ugric-Samoyedic isogloss is the reduction *k, *x, *w > ɣ when before *i, and after a vowel (cf. *k > ɣ above), or adjacent to *t, *s, *š, or *ś.

Finno-Ugric consonant developments after Viitso (2000); Samoyedic changes after Sammallahti (1988)

Note: Proto-Khanty *ɬ in many of the dialects yields *t; it is assumed this also happened in Mansi and Samoyedic.

The inverse relationship between consonant gradation and medial lenition of stops (the pattern also continuing within the three families where gradation is found) is noted by Helimski (1995): an original allophonic gradation system between voiceless and voiced stops would have been easily disrupted by a spreading of voicing to previously unvoiced stops as well.

Based on phonological isoglosses, Häkkinen (2007) proposes the following tree:

Uralic

West Uralic

Saamic

Finnic

Mordvinic

Central Uralic (?)

Mari

Permic

East Uralic

Hungarian

Ob-Ugric

Mansi

Khanty

Samoyedic

Typology

Structural characteristics generally said to be typical of Uralic languages include:

Grammar

extensive use of independent suffixes (agglutination)

a large set of grammatical cases marked with agglutinative suffixes (13–14 cases on average; mainly later developments: Proto-Uralic is reconstructed with 6 cases), e.g.:

Erzya: 12 cases

Estonian: 14 cases (15 cases with instructive)

Finnish: 15 cases

Hungarian: 18 cases (together 34 grammatical cases and case-like suffixes)

Inari Sami: 9 cases

Komi: in certain dialects as many as 27 cases

Moksha: 13 cases

Nenets: 7 cases

North Sami: 6 cases

Udmurt: 16 cases

Veps: 24 cases

unique Uralic case system, from which all modern Uralic languages derive their case systems.

nominative singular has no case suffix.

accusative and genitive suffixes are nasal sounds (-n, -m, etc.)

three-way distinction in the local case system, with each set of local cases being divided into forms corresponding roughly to "from", "to", and "in/at"; especially evident, e.g. in Hungarian, Finnish and Estonian, which have several sets of local cases, such as the "inner", "outer" and "on top" systems in Hungarian, while in Finnish the "on top" forms have merged to the "outer" forms.

the Uralic locative suffix exists in all Uralic languages in various cases, e.g. Hungarian superessive, Finnish essive (-na), North Sami essive, Erzyan inessive, and Nenets locative.

the Uralic lative suffix exists in various cases in many Uralic languages, e.g. Hungarian illative, Finnish lative (-s as in rannemmas), Erzyan illative, Komi approximative, and Northern Sami locative.

a lack of grammatical gender, including one pronoun for both he and she; for example, hän in Finnish, tämä in Votic, tema in Estonian, sijə in Komi, ő in Hungarian.

negative verb, which exists in almost all Uralic languages

use of postpositions as opposed to prepositions (prepositions are uncommon).

possessive suffixes

the genitive is also used to express possession in some languages, e.g. Estonian mu koer, colloquial Finnish mun koira, Northern Sami mu beana 'my dog' (literally 'dog of me'). Separate possessive adjectives and possessive pronouns, such as my and your, are rare.

dual, in the Samoyedic, Ob-Ugric and Samic languages and reconstructed for Proto-Uralic

plural markers -j (i) and -t (-d, -q) have a common origin (e.g. in Finnish, Estonian, Võro, Erzya, Samic languages, Samoyedic languages). Hungarian, however, has -i- before the possessive suffixes and -k elsewhere. In the old orthographies, the plural marker -k was also used in the Samic languages.

Possessions are expressed by a possessor in the adessive or dative case, the verb "be" (the copula, instead of the verb "have") and the possessed with or without a possessive suffix. The grammatical subject of the sentence is thus the possessed. In Finnish, for example, the possessor is in the adessive case: "Minulla on kala", literally "At me is fish", i.e. "I have a fish", whereas in Hungarian, the possessor is in the dative case, but appears overtly only if it is contrastive, while the possessed has a possessive ending indicating the number and person of the possessor: "(Nekem) van egy halam", literally "(To me [dative]) is a fish-my", i.e. "(As for me,) I have a fish".

expressions that include a numeral are singular if they refer to things which form a single group, e.g. "négy csomó" in Hungarian, "njeallje čuolmma" in Northern Sami, "neli sõlme" in Estonian, and "neljä solmua" in Finnish, each of which means "four knots", but the literal approximation is "four knot". (This approximation is inaccurate for Finnish and Estonian, where the singular is in the partitive case, such that the number points to a part of a larger mass, like "four of knot(s)".)

Phonology

Vowel harmony: this is present in many but by no means all Uralic languages. It exists in Hungarian and various Baltic-Finnic languages, and is present to some degree elsewhere, such as in Mordvinic, Mari, Eastern Khanty, and Samoyedic. It is lacking in Sami, Permic and standard Estonian, while it does exist in Võro and elsewhere in South Estonian. (Although umlaut letters are used in writing Uralic languages, the languages do not exhibit Germanic umlaut; front and back values are intrinsic features of words and modify suffixes, not vice versa as in umlaut.)

Large vowel inventories. For example, some Selkup varieties have over twenty different monophthongs, and Estonian has over twenty different diphthongs.

Palatalization of consonants; in this context, palatalization means a secondary articulation, where the middle of the tongue is tense. For example, pairs like [ɲ] – [n], or [c] – [t] are contrasted in Hungarian, as in hattyú [hɒcːuː] "swan". Some Sami languages, for example Skolt Sami, distinguish three degrees: plain ⟨l⟩ [l], palatalized ⟨'l⟩ [lʲ], and palatal ⟨lj⟩ [ʎ], where ⟨'l⟩ has a primary alveolar articulation, while ⟨lj⟩ has a primary palatal articulation. Original Uralic palatalization is phonemic, independent of the following vowel and traceable to the millennia-old Proto-Uralic. It is different from Slavic palatalization, which is of more recent origin. The Finnic languages have lost palatalization, but the eastern varieties have reacquired it, so Finnic palatalization (where extant) was originally dependent on the following vowel and does not correlate to palatalization elsewhere in Uralic.

Lack of phonologically contrastive tone.

In many Uralic languages, the stress is always on the first syllable, though Nganasan shows (essentially) penultimate stress, and a number of languages of the central region (Erzya, Mari, Udmurt and Komi-Permyak) synchronically exhibit a lexical accent. The Erzya language can vary its stress in words to give specific nuances to sentential meaning.

Lexicography

Basic vocabulary of about 200 words, including body parts (e.g. eye, heart, head, foot, mouth), family members (e.g. father, mother-in-law), animals (e.g. viper, partridge, fish), nature objects (e.g. tree, stone, nest, water), basic verbs (e.g. live, fall, run, make, see, suck, go, die, swim, know), basic pronouns (e.g. who, what, we, you, I), numerals (e.g. two, five); derivatives increase the number of common words.

Selected cognates

The following is a very brief selection of cognates in basic vocabulary across the Uralic family, which may serve to give an idea of the sound changes involved. This is not a list of translations: cognates have a common origin, but their meaning may be shifted and loanwords may have replaced them.

Orthographical notes: The hacek denotes postalveolar articulation (⟨ž⟩ [ʒ], ⟨š⟩ [ʃ], ⟨č⟩ [t͡ʃ]) (In Northern Sami, (⟨ž⟩ [dʒ]), while the acute denotes a secondary palatal articulation (⟨ś⟩ [sʲ ~ ɕ], ⟨ć⟩ [tsʲ ~ tɕ], ⟨l⟩ [lʲ]) or, in Hungarian, vowel length. The Finnish letter ⟨y⟩ and the letter ⟨ü⟩ in other languages represent the high rounded vowel [y]; the letters ⟨ä⟩ and ⟨ö⟩ are the front vowels [æ] and [ø].

As is apparent from the list, Finnish is the most conservative of the Uralic languages presented here, with nearly half the words on the list below identical to their Proto-Uralic reconstructions and most of the remainder only having minor changes, such as the conflation of *ś into /s/, or widespread changes such as the loss of *x and alteration of *ï. Finnish has even preserved old Indo-European borrowings relatively unchanged as well. (An example is porsas ("pig"), loaned from Proto-Indo-European *porḱos or pre-Proto-Indo-Iranian *porśos, unchanged since loaning save for loss of palatalization, *ś > s.)

Mutual intelligibility

The Estonian philologist Mall Hellam proposed cognate sentences that she asserted to be mutually intelligible among the three most widely spoken Uralic languages: Finnish, Estonian, and Hungarian:

Estonian: 'Elav kala ujub vee all.

Finnish: 'Elävä kala ui veden alla.

Hungarian: 'Eleven hal úszik a víz alatt.

English: A live fish is swimming underwater.

However, linguist Geoffrey Pullum reports that neither Finns nor Hungarians could understand the other language's version of the sentence.

Comparison

No Uralic language exactly has the idealized typological profile of the family. Typological features with varying presence among the modern Uralic language groups include:

Notes:

Clearly present only in Nganasan.
Vowel harmony is present in the Uralic languages of Siberia only in some marginal archaic varieties: Nganasan, Southern Mansi and Eastern Khanty.
A number of umlaut processes are found in Livonian.
In Komi, but not in Udmurt.

Possible relations with other families

Many relationships between Uralic and other language families have been suggested, but none of these are generally accepted by linguists at the present time.

Indo-Uralic

The Indo-Uralic (or Uralo-Indo-European) hypothesis suggests that Uralic and Indo-European are related at a fairly close level or, in its stronger form, that they are more closely related than either is to any other language family. It is viewed as certain by a few linguists (see main article) and as possible by a larger number.

Uralic–Yukaghir

The Uralic–Yukaghir hypothesis identifies Uralic and Yukaghir as independent members of a single language family. It is currently widely accepted that the similarities between Uralic and Yukaghir languages are due to ancient contacts. Regardless, the hypothesis is accepted by a few linguists and viewed as attractive by a somewhat larger number.

Eskimo–Uralic

The Eskimo–Uralic hypothesis associates Uralic with the Eskimo–Aleut languages. This is an old thesis whose antecedents go back to the 18th century. An important restatement of it is Bergsland 1959.

Uralo-Siberian

Uralo-Siberian is an expanded form of the Eskimo–Uralic hypothesis. It associates Uralic with Yukaghir, Chukotko-Kamchatkan, and Eskimo–Aleut. It was propounded by Michael Fortescue in 1998.

Ural–Altaic

Theories proposing a close relationship with the Altaic languages were formerly popular, based on similarities in vocabulary as well as in grammatical and phonological features, in particular the similarities in the Uralic and Altaic pronouns and the presence of agglutination in both sets of languages, as well as vowel harmony in some. For example, the word for "language" is similar in Estonian (keel) and Mongolian (хэл (hel)). These theories are now generally rejected and most such similarities are attributed to language contact or coincidence.

Nostratic

Nostratic associates Uralic, Indo-European, Altaic, Dravidian, and various other language families of Asia. The Nostratic hypothesis was first propounded by Holger Pedersen in 1903 and subsequently revived by Vladislav Illich-Svitych and Aharon Dolgopolsky in the 1960s.

Eurasiatic

Eurasiatic resembles Nostratic in including Uralic, Indo-European, and Altaic, but differs from it in excluding the South Caucasian languages, Dravidian, and Afroasiatic and including Chukotko-Kamchatkan, Nivkh, Ainu, and Eskimo–Aleut. It was propounded by Joseph Greenberg in 2000–2002. Similar ideas had earlier been expressed by Heinrich Koppelmann (1933) and by Björn Collinder (1965:30–34).

Uralo-Dravidian

The hypothesis that the Dravidian languages display similarities with the Uralic language group, suggesting a prolonged period of contact in the past, is popular amongst Dravidian linguists and has been supported by a number of scholars, including Robert Caldwell, Thomas Burrow, Kamil Zvelebil, and Mikhail Andronov. This hypothesis has, however, been rejected by some specialists in Uralic languages, and has in recent times also been criticised by other Dravidian linguists, such as Bhadriraju Krishnamurti.

Uralic skepticism

In her book, The Uralic language family: facts, myths, and statistics, linguist Angela Marcantonio argues against the existence of the Uralic family, claiming that the languages are no more closely related to each other than they are to various other Eurasian languages.

All of these hypotheses are minority views at the present time in Uralic studies.

Other comparisons

Various unorthodox comparisons have been advanced such as Finno-Basque and Hungaro-Sumerian. These are considered spurious by specialists.

References

Uralic languages Wikipedia

(Text) CC BY-SA

Contents

Homeland

Early attestations

Uralic studies

Classification

Traditional classification

Lexical isoglosses

Phonological isoglosses

Typology

Grammar

Phonology

Lexicography

Selected cognates

Mutual intelligibility

Comparison

Possible relations with other families

Indo-Uralic

Uralic–Yukaghir

Eskimo–Uralic

Uralo-Siberian

Ural–Altaic

Nostratic

Eurasiatic

Uralo-Dravidian

Uralic skepticism

Other comparisons

References