Central, Eastern, and Northern Europe, North Asia
One of the world's primary language families
Finnic Hungarian Khanty Mansi Mari Mordvinic Permic Sami Samoyedic
The Uralic languages /jʊˈrælᵻk/ (sometimes called Uralian languages /jʊˈreɪliən/) constitute a language family of 38 languages spoken by approximately 25 million people, predominantly in Northern Eurasia. The Uralic languages with the most native speakers are Hungarian, Finnish, and Estonian, which are official languages of Hungary, Finland, and Estonia, respectively, and of the European Union. Other Uralic languages with significant numbers of speakers are Erzya, Moksha, Mari, Udmurt, and Komi, which are officially recognized languages in various regions of Russia.
- Early attestations
- Uralic studies
- Traditional classification
- Lexical isoglosses
- Phonological isoglosses
- Selected cognates
- Mutual intelligibility
- Possible relations with other families
- Indo Uralic
- Uralo Siberian
- Uralo Dravidian
- Uralic skepticism
- Other comparisons
The name "Uralic" derives from the fact that areas where the languages are spoken spread on both sides of the Ural Mountains. Also, the original homeland (Urheimat) is commonly hypothesized to lie in the vicinity of the Urals.
Finno-Ugric is sometimes used as a synonym for Uralic, though Finno-Ugric is widely understood to exclude the Samoyedic languages. Scholars who do not accept the traditional notion that Samoyedic split first from the rest of the Uralic family, such as Tapani Salminen, may treat both terms as synonymous.
In recent times, linguists often place the Urheimat (original homeland) of the Proto-Uralic language in the vicinity of the Volga River, west of the Urals, close to the Urheimat of the Indo-European languages, or to the east and southeast of the Urals. Gyula László places its origin in the forest zone between the Oka River and central Poland. E. N. Setälä and M. Zsirai place it between the Volga and Kama Rivers. According to E. Itkonen, the ancestral area extended to the Baltic Sea. P. Hajdu has suggested a homeland in western and northwestern Siberia.
The first plausible mention of a Uralic people is in Tacitus's Germania (c. 98 AD), mentioning the Fenni (usually interpreted as referring to the Sami) and two other possibly Uralic tribes living in the farthest reaches of Scandinavia. There are many possible earlier mentions, including the Iyrcae (perhaps related to Yugra) described by Herodotus living in what is now European Russia, and the Budini, described by Herodotus as notably red-haired (a characteristic feature of the Udmurts) and living in northeast Ukraine and/or adjacent parts of Russia. In the late 15th century, European scholars noted the resemblance of the names Hungaria and Yugria, the names of settlements east of the Ural. They assumed a connection but did not seek linguistic evidence.
The affinity of Hungarian and Finnish was first proposed in the late 17th century. Two of the more important contributors were the German scholar Martin Vogel, who established several grammatical and lexical parallels between Finnish and Hungarian, and the Swedish scholar Georg Stiernhielm, who commented on the similarities of Sami, Estonian and Finnish, and also on a few similar words between Finnish and Hungarian. These two authors were thus the first to outline what was to become the classification of the Finno-Ugric, and later Uralic family. This proposal received some of its initial impetus from the fact that these languages, unlike most of the other languages spoken in Europe, are not part of what is now known as the Indo-European family.
In 1717, Swedish professor Olof Rudbeck proposed about 100 etymologies connecting Finnish and Hungarian, of which about 40 are still considered valid. In the same year, the German scholar Johann Georg von Eckhart, in an essay published in Leibniz's Collectanea Etymologica, proposed for the first time a relation to the Samoyedic languages.
Philip Johan von Strahlenberg in 1730 published his book Das Nord- und Ostliche Theil von Europa und Asia, surveying the geography, peoples and languages of Russia. All the main groups of the Uralic languages were already identified here. Nonetheless, these relationships were not widely accepted. Hungarian intellectuals especially were not interested in the theory and preferred to assume connections with Turkic tribes, an attitude characterized by Ruhlen (1987) as due to "the wild unfettered Romanticism of the epoch". Still, in spite of this hostile climate, the Hungarian Jesuit János Sajnovics travelled with Maximilian Hell to survey the alleged relationship between Hungarian and Sami. Sajnovics published his results in 1770, arguing for a relationship based on several grammatical features. In 1799, the Hungarian Sámuel Gyarmathi published the most complete work on Finno-Ugric to that date.
At the beginning of the 19th century, research on this family was thus more advanced than Indo-European research. But the rise of Indo-European comparative linguistics absorbed so much attention and enthusiasm that Uralic linguistics was all but eclipsed in Europe; in Hungary, the only European country that would have had a vested interest in the family (Finland and Estonia being under Russian rule), the political climate was too hostile for the development of Uralic comparative linguistics.
Progress resumed after a number of decades with the first major field research expeditions on the Uralic languages spoken in the more eastern parts of Russia. These were made in the 1840s by Matthias Castrén (1813–1852) and Antal Reguly (1819–1858), who focused especially on the Samoyedic and the Ob-Ugric languages, respectively. Reguly's materials were worked on by the Hungarian linguist Pál Hunfalvy (1810–1891) and German Josef Budenz (1836–1892), who both supported the Uralic affinity of Hungarian. Budenz was the first scholar to bring this result to popular consciousness in Hungary, and to attempt a reconstruction of the Proto-Finno-Ugric grammar and lexicon. Another late-19th-century Hungarian contribution is that of Ignácz Halász (1855–1901), who published extensive comparative material of Finno-Ugric and Samoyedic in the 1890s, and whose work is at the base of today's wide acceptance of the inclusion of Samoyedic as a part of Uralic. Meanwhile, in the autonomous Grand Duchy of Finland, a chair for Finnish language and linguistics at the University of Helsinki was created in 1850, first held by Castrén.
In 1883 the Finno-Ugrian Society was founded in Helsinki on the proposal of Otto Donner, and during the late 19th and early 20th century (until the separation of Finland from Russia following the Russian revolution), a large number of stipendiates were sent by the Society to survey the less known Uralic languages. Major researchers of this period included Heikki Paasonen (studying especially the Mordvinic languages), Yrjö Wichmann (fi) (Permic), Artturi Kannisto (fi) (Mansi), Kustaa Fredrik Karjalainen (Khanty), Toivo Lehtisalo (Nenets), and Kai Donner (Kamass). The vast amounts of data collected on these expeditions would provide edition work for later generations of Finnish Uralicists for more than a century.
The Uralic family comprises nine undisputed groups with no consensus classification between them. (Some of the proposals are listed in the next section.) An agnostic approach treats them as separate branches.
Obsolete or native names are displayed in italics.
There is also historical evidence of a number of extinct languages of uncertain affiliation:
Traces of Finno-Ugric substrata, especially in toponymy, in the northern part of European Russia have been proposed as evidence for even more extinct Uralic languages.
All Uralic languages are thought to have descended, through independent processes of language change, from Proto-Uralic. The internal structure of the Uralic family has been debated since the family was first proposed. Doubts about the validity of most of the proposed higher-order branchings (grouping the nine undisputed families) are becoming more common.
A traditional classification of the Uralic languages has existed since the late 19th century, tracing back to Donner (1879). It has enjoyed frequent adaptation in whole or in part in encyclopedias, handbooks, and overviews of the Uralic family. Donner's model is as follows:
At Donner's time, the Samoyedic languages were still poorly known, and he was not able to address their position. As they became better known in the early 20th century, they were found to be quite divergent, and they were assumed to have separated already early on. The terminology adopted for this was "Uralic" for the entire family, "Finno-Ugric" for the non-Samoyedic languages (though "Finno-Ugric" has, to this day, remained in use also as a synonym for the whole family). Finno-Ugric and Samoyedic are listed in ISO 639-5 as primary branches of Uralic.
Nodes of the traditional family tree recognized in some overview sources:
- Hajdú describes the Ugric and Volgaic groups as areal units.
- Austerlitz however accepts narrower-than-traditional Finno-Ugric and Finno-Permic groups that exclude Samic.
Little explicit evidence has however been presented in favor of Donner's model since his original proposal, and numerous alternate schemes have been proposed. Especially in Finland there has been a growing tendency to reject the Finno-Ugric intermediate protolanguage. A recent competing proposal instead unites Ugric and Samoyedic in an "East Uralic" group for which shared innovations can be noted.
The Finno-Permic grouping still holds some support, though the arrangement of its subgroups is a matter of some dispute. Mordvinic is commonly seen as particularly closely related to or part of Finno-Samic. The term Volgaic (or Volga-Finnic) was used to denote a branch previously believed to include Mari, Mordvinic and a number of the extinct languages, but it is now obsolete and considered a geographic classification rather than a linguistic one.
Within Ugric, uniting Mansi with Hungarian rather than Khanty has been a competing hypothesis to Ob-Ugric.
Lexicostatistics has been used in defense of the traditional family tree. A recent re-evaluation of the evidence however fails to find support for Finno-Ugric and Ugric, suggesting four lexically distinct branches (Finno-Permic, Hungarian, Ob-Ugric and Samoyedic).
One alternate proposal for a family tree, with emphasis on the development of numerals, is as follows:
Another proposed tree, more divergent from the standard, focusing on consonant isoglosses (which does not consider the position of the Samoyedic languages) is presented by Viitso (1997), and refined in Viitso (2000):
The grouping of the four bottom-level branches remains to some degree open to interpretation, with competing models of Finno-Saamic vs. Eastern Finno-Ugric (Mari, Mordvinic, Permic-Ugric; *k > ɣ between vowels, degemination of stops) and Finno-Volgaic (Finno-Saamic, Mari, Mordvinic; *δ́ > δ between vowels) vs. Permic-Ugric. Viitso finds no evidence for a Finno-Permic grouping.
Extending this approach to cover the Samoyedic languages suggests affinity with Ugric, resulting in the aforementioned East Uralic grouping, as it also shares the same sibilant developments. A further non-trivial Ugric-Samoyedic isogloss is the reduction *k, *x, *w > ɣ when before *i, and after a vowel (cf. *k > ɣ above), or adjacent to *t, *s, *š, or *ś.
Finno-Ugric consonant developments after Viitso (2000); Samoyedic changes after Sammallahti (1988)
The inverse relationship between consonant gradation and medial lenition of stops (the pattern also continuing within the three families where gradation is found) is noted by Helimski (1995): an original allophonic gradation system between voiceless and voiced stops would have been easily disrupted by a spreading of voicing to previously unvoiced stops as well.
Based on phonological isoglosses, Häkkinen (2007) proposes the following tree:
Structural characteristics generally said to be typical of Uralic languages include:
Basic vocabulary of about 200 words, including body parts (e.g. eye, heart, head, foot, mouth), family members (e.g. father, mother-in-law), animals (e.g. viper, partridge, fish), nature objects (e.g. tree, stone, nest, water), basic verbs (e.g. live, fall, run, make, see, suck, go, die, swim, know), basic pronouns (e.g. who, what, we, you, I), numerals (e.g. two, five); derivatives increase the number of common words.
The following is a very brief selection of cognates in basic vocabulary across the Uralic family, which may serve to give an idea of the sound changes involved. This is not a list of translations: cognates have a common origin, but their meaning may be shifted and loanwords may have replaced them.
Orthographical notes: The hacek denotes postalveolar articulation (⟨ž⟩ [ʒ], ⟨š⟩ [ʃ], ⟨č⟩ [t͡ʃ]) (In Northern Sami, (⟨ž⟩ [dʒ]), while the acute denotes a secondary palatal articulation (⟨ś⟩ [sʲ ~ ɕ], ⟨ć⟩ [tsʲ ~ tɕ], ⟨l⟩ [lʲ]) or, in Hungarian, vowel length. The Finnish letter ⟨y⟩ and the letter ⟨ü⟩ in other languages represent the high rounded vowel [y]; the letters ⟨ä⟩ and ⟨ö⟩ are the front vowels [æ] and [ø].
As is apparent from the list, Finnish is the most conservative of the Uralic languages presented here, with nearly half the words on the list below identical to their Proto-Uralic reconstructions and most of the remainder only having minor changes, such as the conflation of *ś into /s/, or widespread changes such as the loss of *x and alteration of *ï. Finnish has even preserved old Indo-European borrowings relatively unchanged as well. (An example is porsas ("pig"), loaned from Proto-Indo-European *porḱos or pre-Proto-Indo-Iranian *porśos, unchanged since loaning save for loss of palatalization, *ś > s.)
The Estonian philologist Mall Hellam proposed cognate sentences that she asserted to be mutually intelligible among the three most widely spoken Uralic languages: Finnish, Estonian, and Hungarian:
However, linguist Geoffrey Pullum reports that neither Finns nor Hungarians could understand the other language's version of the sentence.
No Uralic language exactly has the idealized typological profile of the family. Typological features with varying presence among the modern Uralic language groups include:
- Clearly present only in Nganasan.
- Vowel harmony is present in the Uralic languages of Siberia only in some marginal archaic varieties: Nganasan, Southern Mansi and Eastern Khanty.
- A number of umlaut processes are found in Livonian.
- In Komi, but not in Udmurt.
Possible relations with other families
Many relationships between Uralic and other language families have been suggested, but none of these are generally accepted by linguists at the present time.
The Indo-Uralic (or Uralo-Indo-European) hypothesis suggests that Uralic and Indo-European are related at a fairly close level or, in its stronger form, that they are more closely related than either is to any other language family. It is viewed as certain by a few linguists (see main article) and as possible by a larger number.
The Uralic–Yukaghir hypothesis identifies Uralic and Yukaghir as independent members of a single language family. It is currently widely accepted that the similarities between Uralic and Yukaghir languages are due to ancient contacts. Regardless, the hypothesis is accepted by a few linguists and viewed as attractive by a somewhat larger number.
The Eskimo–Uralic hypothesis associates Uralic with the Eskimo–Aleut languages. This is an old thesis whose antecedents go back to the 18th century. An important restatement of it is Bergsland 1959.
Uralo-Siberian is an expanded form of the Eskimo–Uralic hypothesis. It associates Uralic with Yukaghir, Chukotko-Kamchatkan, and Eskimo–Aleut. It was propounded by Michael Fortescue in 1998.
Theories proposing a close relationship with the Altaic languages were formerly popular, based on similarities in vocabulary as well as in grammatical and phonological features, in particular the similarities in the Uralic and Altaic pronouns and the presence of agglutination in both sets of languages, as well as vowel harmony in some. For example, the word for "language" is similar in Estonian (keel) and Mongolian (хэл (hel)). These theories are now generally rejected and most such similarities are attributed to language contact or coincidence.
Nostratic associates Uralic, Indo-European, Altaic, Dravidian, and various other language families of Asia. The Nostratic hypothesis was first propounded by Holger Pedersen in 1903 and subsequently revived by Vladislav Illich-Svitych and Aharon Dolgopolsky in the 1960s.
Eurasiatic resembles Nostratic in including Uralic, Indo-European, and Altaic, but differs from it in excluding the South Caucasian languages, Dravidian, and Afroasiatic and including Chukotko-Kamchatkan, Nivkh, Ainu, and Eskimo–Aleut. It was propounded by Joseph Greenberg in 2000–2002. Similar ideas had earlier been expressed by Heinrich Koppelmann (1933) and by Björn Collinder (1965:30–34).
The hypothesis that the Dravidian languages display similarities with the Uralic language group, suggesting a prolonged period of contact in the past, is popular amongst Dravidian linguists and has been supported by a number of scholars, including Robert Caldwell, Thomas Burrow, Kamil Zvelebil, and Mikhail Andronov. This hypothesis has, however, been rejected by some specialists in Uralic languages, and has in recent times also been criticised by other Dravidian linguists, such as Bhadriraju Krishnamurti.
In her book, The Uralic language family: facts, myths, and statistics, linguist Angela Marcantonio argues against the existence of the Uralic family, claiming that the languages are no more closely related to each other than they are to various other Eurasian languages.
All of these hypotheses are minority views at the present time in Uralic studies.
Various unorthodox comparisons have been advanced such as Finno-Basque and Hungaro-Sumerian. These are considered spurious by specialists.