Girish Mahajan (Editor)

Cork encoding

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

The Cork (also known as T1 or EC) encoding is a character encoding used for encoding glyphs in fonts. It is named after the city of Cork in Ireland, where during a TeX Users Group (TUG) conference in 1990 a new encoding was introduced for LaTeX. It contains 256 characters supporting most west and east-European languages with the Latin alphabet.

Contents

Details

In 8-bit TeX engines the font encoding has to match the encoding of hyphenation patterns where this encoding is most commonly used. In LaTeX one can switch to this encoding with \usepackage[T1]{fontenc}, while in ConTeXt MkII this is the default encoding already. In modern engines such as XeTeX and LuaTeX the Unicode is fully supported and the 8-bit font encodings are obsolete.

Supported languages

The encoding supports most European languages written in Latin alphabet. Notable exceptions are:

  • Esperanto (using IL3)
  • Latvian language and Lithuanian language (using L7X)
  • Welsh language
  • Languages with slightly suboptimal support include:

  • Galician language, Portuguese language and Spanish language – due to the lack of characters ª and º, which are not superscript versions of lower case “a” and “o” (superscripts are thinner) and they are often underlined
  • Croatian language, Bosnian language, Serbian language – due to the shared use of the slot for Đ
  • Turkish language – due to dotless i having different uppercase and lowercase combinations than in other languages
  • Romanian language - due to having S and T with cedilla
  • References

    Cork encoding Wikipedia


    Similar Topics