Neha Patil (Editor)

Tatar language

Updated on
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Ethnicity  Volga Tatars
Official language in  Russia  Tatarstan
Native to  Russia, other post-Soviet states
Native speakers  6.5 million (2015) (may include some L2 speakers)
Language family  Turkic Common Turkic Kipchak Kipchak–Bulgar Tatar
Writing system  Tatar alphabet (Arabic, Cyrillic, Latin)

The Tatar language (Tatar: татар теле; татарча, tatar tele, tatarça; تاتار تلی or طاطار تيلي) is a Turkic language spoken by Volga Tatars mainly located in modern Tatarstan, Bashkortostan and Nizhny Novgorod Oblast. It should not be confused with the Crimean Tatar language, to which it is remotely related but with which it is not mutually intelligible.


Geographic distribution

The Tatar language is spoken in Russia (about 5.3 million people), Ukraine, China, Finland, Turkey, Uzbekistan, the United States of America, Romania, Azerbaijan, Israel, Kazakhstan, Georgia, Lithuania, Latvia, and other countries. There are more than 7 million speakers of Tatar in the world.

Tatar is also native for several thousand Maris. Mordva's Qaratay group also speak a variant of Kazan Tatar.

In the 2010 census, 69% of Russian Tatars who responded to the question about language ability claimed a knowledge of the Tatar language. In Tatarstan, 93% of Tatars and 3,6% of Russians did so. In neighbouring Bashkortostan, 67% of Tatars, 27% of Bashkirs, and 1,3% of Russians did.

Official status

Tatar, along with Russian, is the official language of the Republic of Tatarstan. The official script of Tatar language is based on the Cyrillic script with some additional letters. The Republic of Tatarstan passed a law in 1999, which came into force in 2001, establishing an official Tatar Latin alphabet. A Russian federal law overrode it in 2002, making Cyrillic the sole official script in Tatarstan since. Unofficially, other scripts are used as well, mostly Latin and Arabic. All official sources in Tatarstan must use Cyrillic on their websites and in publishing. In other cases, where Tatar has no official status, the use of a specific alphabet depends on the preference of the author.

The Tatar language was made a de facto official language in Russia in 1917, but only in the Tatar Autonomous Soviet Socialist Republic. Tatar is also considered to have been the official language in the short-lived Idel-Ural State, briefly formed during the Russian Civil War. One should note, however, that Bolshevist Russia did not recognize official languages as such; however, there were a number of languages that could be used in trial in some republics. In the Soviet era, Tatar was such a language in Bashkortostan, Mari El and other regions of the Russian SFSR.

The usage of Tatar declined from the 1930s onwards. In the 1980s, the study and teaching of Tatar in the public education system was limited to rural schools. However, Tatar-speaking pupils had little chance of entering university because higher education was available in Russian almost exclusively.

As of 2001 Tatar was considered a potentially endangered language while Siberian and Crimean Tatar languages received "endangered" and "seriously endangered" statuses, respectively. Higher education in Tatar can only be found in Tatarstan, and is restricted to the humanities. In other regions Tatar is primarily a spoken language and the number of speakers as well as their proficiency tends to decrease. Tatar is popular as a written language only in Tatar-speaking areas where schools with Tatar language lessons are situated. On the other hand, Tatar is the only language in use in rural districts of Tatarstan.

Dialects of Tatar

There are three main dialects of Tatar: Western (Mişär or Mishar), Middle (Kazan), and Eastern (Siberian). All of these dialects also have subdivisions. Significant contribution to the study of the Tatar language and its dialects, were made by a scientist Gabdulkhay Akhatov, who is considered to be the founder of the modern Tatar dialectological school.


In the Western (Mişär) dialect Ç is pronounced [tʃ] (southern or Lambir Mişärs) and as [ts] (northern Mişärs or Nizhgars). C is pronounced [dʒ]. There are no differences between v and w, q and k, g and ğ in the Mişär dialect. (The Cyrillic alphabet doesn't have special letters for q, ğ and w, so Mişär speakers have no difficulty reading Tatar written in Cyrillic.)

This is the dialect spoken by the Tatar minority of Finland.



In the Minzälä subdialect of the Middle Dialect z is pronounced [ð], as opposed to other dialects where it is silent.


In bilingual cities people often pronounce h as [x], q as [k], ğ as [ɡ], w as [v]. This could be due to Russian influence. Another possibility is that these cities were places where both the western and middle dialects were used.

The influence of Russian is significant. Russian words and phrases are used with Tatar grammar or Russian grammar in Tatar texts. Some Russian verbs are taken entirely, un-nativized, and followed with itärgä. Some English words and phrases are also used.


The Yaña Bistä slang, Yaña Bistä gäbe or simply Gäp was a distinct cryptolect of the Tatar language, spoken in Yaña Bistä (The New Quarter) of Kazan. It has been extinct or near extinct since the 1920s. The vocabulary and grammar of this sociolect differ from those of standard Tatar. The vocabulary includes some words from Central Asian languages. Modern Tatar slang is also sometimes known as gäp. Gäp is phonetically equivalent to the English word gap and is Standard Tatar for a talk.

Siberian Tatar

Siberian Tatars pronounce ç as [ts], c as [j] and sometimes b as [p], d as [t], f as p, j as ch, t as d, z as s and h as k. There are also grammatical differences within the dialect, scattered across Siberia.

Many linguists claim the origins of Siberian Tatar dialects are actually independent of Volga–Ural Tatar; these dialects are quite remote both from Standard Tatar and from each other, often preventing mutual comprehension. The claim that this language is part of the modern Tatar language is typically supported by linguists in Kazan and denounced by Siberian Tatars.

Over time, some of these dialects were given distinct names and recognized as separate languages (e.g. the Chulym language) after detailed linguistic study. A brief linguistic analysis shows that many of these dialects exhibit features which are quite different from the Volga–Ural Tatar varieties, and should be classified as Turkic varieties belonging to several sub-groups of the Turkic languages, distinct from Kipchak languages to which Volga–Ural Tatar belongs.

By studying the phonetic peculiarities of the dialect of the local population of Siberia, Professor Gabdulkhay Akhatov first among the scientists discovered in the Speech of the Siberian Tatars is such a thing as 'clip-clop', which in his opinion, was obtained for the Siberian Tatars of Kipchaks. In his classic fundamental research work "Dialect of the West Siberian Tatars" (1963) Gabdulkhay Akhatov wrote about a territorial resettlement of the Tobol-Irtysh Tatars Tyumen and Omsk areas. Subjecting a comprehensive integrated analysis of the phonetic system, the lexical composition and grammatical structure, the scientist concluded that the language of the Siberian Tatars is a separate language, it is divided into three dialects and it is one of the most ancient Turkic languages.


Phonemically, Tatar may be argued to have two vowel heights, high and low. There are two low vowels, front and back, while there are eight high vowels: front and back, round and unround, long and short. However, phonetically, the short high vowels are reduced: they are mid-centralized. They are therefore generally transcribed with mid vowel letters such as e and o: high front i ü, high back ï u, reduced (mid) front e ö, reduced (mid) back ë o, and low ä, a. The high back unrounded vowel ï is only found in Russian loans, though the native diphthong ëy, which only occurs word-finally, has been argued to be phonemically ï. Loaned vowels are considered to be back vowels.

Phonetically, the native vowels are approximately high и/i [i], ү/ü [ʉ], у/u [u], reduced е (э)/e [ɘ̆], ө/ө [ɵ̆], ы/ı [ɤ̆~ʌ̆] о/o [ŏ] (ë may be mid-low), and low ә/ə [a~æ], а/а [ɑ]. In polysyllabic words, the front-back distinction is lost in reduced vowels: all become mid-central. Reduced vowels in unstressed position are frequently elided. Low back /ɑ/ is rounded [ɒ] word-initially and after [ɒ], as in bala 'child'. In Russian loans there are also [ɨ], [ɛ], and [ɔ]

Historically, the Turkic high vowels have become the Tatar reduced series, whereas the Turkic mid vowels have replaced them. Thus Kazakh til 'language' and kün 'day' correspond to Tatar tel and kön, while Kazakh men 'I', qol 'hand', and kök 'sky' are in Tatar min, qul, kük.


Most of these phonemes are common to or have equivalents in all Turkic languages, but the phonemes /v/, /ts/, /h/ and /ʒ/ are only found in loanwords in Literary Tatar. /f/ is also of foreign origin, but is also found in native words, e.g. yafraq "leaf".

Pronunciation of loanwords

While the consonants [ʒ], [f] and [v] are not native to Tatar, they are well established. However, Tatars usually substitute fricatives for affricates, for example [ɕ] for [tʃ], [ʒ] or [ʑ] for [dʒ], and [s] for [ts]. Nevertheless, literary traditions recommend the pronunciation of affricates in loanwords.

[ʔ] (hamza) is a sound found in Arabic loanwords and Islamic prayers. It is usually pronounced [e] in loanwords.


Palatalisation is not common in Tatar. As a result, speakers have no problem using the Arabic and Jaꞑalif scripts, neither of which has an accepted method for indicating palatalisation.

In general, Russian words with palatalisation have entered into the speech of bilingual Tatars since the 1930s. When writing in the Cyrillic alphabet, Russian words are spelled as they are in Russian. In today's Latin orthography, palatalisation is sometimes represented by an acute diacritic under the vowel.

Some Tatars speak Russian without palatalisation, which is known as a Tatar accent.

Syllable types

  • V (ı-lıs, u-ra, ö-rä)
  • VC (at-law, el-geç, ir-kä)
  • CV (qa-la, ki-ä, su-la)
  • CVC (bar-sa, sız-law, köç-le, qoş-çıq)
  • VCC (ant-lar, äyt-te, ilt-kän)
  • CVCC (tört-te, qart-lar, 'qayt-qan)
  • Stress is on the final syllable.

    Phonetic replacement

    Tatar phonotactics dictate many pronunciation changes.

    Unrounded vowels may be pronounced as rounded after o or ö:

    qorı /qoro/
    borın /boron/
    közge /közgö/
    sorı /soro/)

    Nasals are assimilated to following stops:

    unber /umber/
    mengeç /meñgeç/

    Voicing may also undergo assimilation:

    küzsez /küssez/

    Unstressed vowels may be syncopated or reduced:

    urını /urnı/
    kilene /kilne/
    bezne /bĕzne/
    kerdem /kĕrdem/
    qırğıç /qĭrğıç/

    Vowels may also be elided:

    qara urman /qar'urman/
    kilä ide /kilä'yde/
    turı uram /tur'uram/
    bula almím /bul'almím/

    In consonant clusters longer than two phones, ı or e (whichever is dictated by vowel harmony) is inserted into speech as an epenthetic vowel.

    tekst → /tekest/
    bank → /banık/ (not /bañk/)

    Final devoicing is also frequent:

    tabíb (doctor) → [tabíp]


    Like other Turkic languages, Tatar is an agglutinative language.

    Grammatical case:

  • Nominative -
  • Accusative: -ны/-не, -н
  • Possessive: -ның/-нең
  • Dative: -га/-гә, -ка/-кә, -а/-ә, -на/-нә
  • Locative: -да/-дә, -та/-тә, -нда/-ндә
  • Ablative: -дан/-дән, -тан/-тән, -нан/-нән, -ннан/-ннән
  • Plural:

  • Nominative: -лар/-ләр, -нар/-нәр
  • Accusative: -ларны
  • Possessive: -ларның
  • Dative: -ларга
  • Locative: -ларда
  • Ablative: -лардан
  • Plural

  • After vowels, consonants, hard: -lar (bala-lar, abí-lar, kitap-lar, qaz-lar, malay-lar, qar-lar, ağaç-lar)
  • After vowels, consonants, soft: -lär (äni-lär, sölge-lär, däftär-lär, kibet-lär, süz-lär, bäbkä-lär, mäktäp-lär, xäref-lär)
  • After nasals, hard: -nar (uram-nar, urman-nar, tolım-nar, moñ-nar, tañ-nar, şalqan-nar)
  • After nasals, soft: -när (ülän-när, keläm-när, çräm-när, iñ-när, ciñ-när, isem-när)
  • Writing system

    Tatar has been written in a number of different alphabets.

    Writing was adopted from the Bolgar language, which used the Orkhon script, before the 920s. Later, the Arabic script was also used, as well as Latin and Cyrillic.

    Before 1928, Tatar was written with an Arabic alphabet (İske imlâ to 1920; Yaña imlâ 1920–1928).

    In the Soviet Union after 1928, Tatar was written with a Latin alphabet called Jaꞑalif.

    In 1939, in Tatarstan (a republic of Russia where Tatar is most commonly used) and all other parts of the Soviet Union a Cyrillic script was developed and is still used to write Tatar. It is also used in Kazakhstan.

    The Republic of Tatarstan passed a law in 1999 that came into force in 2001 establishing an official Tatar Latin alphabet. A Russian federal law overrode it in 2002, making Cyrillic the sole official script in Tatarstan since. In 2004, an attempt to introduce a Latin-based alphabet for Tatar was further abandoned when the Constitutional Court ruled that the federal law of the 15th of November 2002 mandating the use of Cyrillic for the state languages of the republics of the Russian Federation does not contradict the Russian constitution. In accordance with this Constitutional Court ruling, on 28 December 2004, the Tatar Supreme Court overturned the Tatarstani law that made the Latin alphabet official.

    In China, Tatars still use the Arabic script.

  • Tatar Cyrillic alphabet (letter order adopted in 1997):
  • 1999 Tatar Latin alphabet, made official by a law adopted by Tatarstani authorities but annulled by the Tatar Supreme Court in 2004:
  • History

    Tatar's ancestors are the extinct Bulgar and Kipchak languages. Crimean Tatar, although another Kipchak language, is more akin to standard Turkish due to influence from it.

    The literary Tatar language is based on the Middle Tatar dialect and on the Old Tatar language (İske Tatar Tele). Both are members of the Kipchak group of Turkic languages, although they also partly derive from the ancient Volga Bulgar language.

    Most of the Uralic languages in the Volga River area have strongly influenced the Tatar language, as have the Arabic, Persian and Russian languages.


  • әйе - äye – yes
  • юк - yuq – no
  • исәнме(сез)/саумы(сыз) - isänme(sez)/sawmı(sız) – hello
  • сәлам - sälam – hi
  • сау бул(ыгыз)/хуш(ыгыз) - saw bul(ığız)/xuş(ığız) – bye bye
  • зинһар өчен - zinhar öçen – please
  • рәхмәт - räxmät – thank you
  • гәфу ит(егез) - ğafu it(egez) – excuse me
  • мин - min – I
  • син - sin – you (sg.)
  • сез - sez – you (pl.)
  • ул - ul – he / she / it
  • без - bez – we
  • алар - alar – they
  • милләт - millät – nation
  • Инглизчә - İnglizçä – English
  • References

    Tatar language Wikipedia

    Similar Topics
    Assia and the Hen with the Golden Eggs
    Clifford Aboagye
    Tyler Smith (basketball)