{"id":7317,"date":"2015-02-09T07:11:39","date_gmt":"2015-02-09T07:11:39","guid":{"rendered":"https:\/\/blogs.transparent.com\/russian\/?p=7317"},"modified":"2018-08-15T19:15:01","modified_gmt":"2018-08-15T19:15:01","slug":"digital-russian-how-to-view-and-input-russian-text","status":"publish","type":"post","link":"https:\/\/blogs.transparent.com\/russian\/digital-russian-how-to-view-and-input-russian-text\/","title":{"rendered":"Digital Russian: How To View and Input Russian Text"},"content":{"rendered":"<p>As we are irreversibly living in a digital age, it is no longer possible to stick to books and handwritten letters in learning and practicing languages. Russian is no exception to this, and even beginners soon encounter Russian websites, social networks, and other digital content. However, many learners, especially those from Anglophone countries, often have\u00a0limited experience with digital Russian. Awareness is half the battle, so let&#8217;s look at some of areas of potential trouble.<\/p>\n<h2>Character Encoding<\/h2>\n<p>If you are only using an English keyboard, you will rarely run into problems with\u00a0the correct display of text. However, readers of non-Latin scripts &#8212; like Russian &#8212; and to a lesser degree, Latin scripts with accented characters will have run into gibberish being displayed instead of text. This is due to an incorrect encoding (<strong>\u043a\u043e\u0434\u0438\u0440\u043e\u0301\u0432\u043a\u0430<\/strong>).<\/p>\n<p>Without going into too much technical detail, an <a href=\"http:\/\/en.wikipedia.org\/wiki\/Character_encoding\">encoding<\/a> matches up a sequence of bits, if we are talking about digital text, to a character. Historically, many programs, browsers, and user interfaces were created only with English in mind, using character sets (<strong>\u043d\u0430\u0431\u043e\u0301\u0440 \u0441\u0438\u0301\u043c\u0432\u043e\u043b\u043e\u0432<\/strong>) limited to English characters (see\u00a0<a href=\"http:\/\/en.wikipedia.org\/wiki\/ASCII\">ASCII<\/a>), or for\u00a0an <a href=\"http:\/\/en.wikipedia.org\/wiki\/Character_encoding#Common_character_encodings\">extended set of Latin characters<\/a> to accommodate various languages that used special and accented characters.<\/p>\n<p>Separate <a href=\"https:\/\/en.wikipedia.org\/wiki\/Code_page\">code pages<\/a>\u00a0were developed for Russian, like <a href=\"https:\/\/en.wikipedia.org\/wiki\/KOI_character_encodings\">KOI-8 <\/a>and <a href=\"https:\/\/ru.wikipedia.org\/wiki\/Windows-1251\">Windows-1251<\/a>. They had Russian characters where extended Latin code pages had accented characters. As a result, <span style=\"color: #800000\"><strong>using an incorrect encoding when trying to view text in Russian would <a style=\"color: #800000\" href=\"http:\/\/en.wikipedia.org\/wiki\/Mojibake#Russian_and_other_Cyrillic_alphabets\">scramble it completely<\/a>.<\/strong><\/span><\/p>\n<p>Luckily, with the advent of <a href=\"https:\/\/en.wikipedia.org\/wiki\/Unicode\">Unicode<\/a>, most of the world&#8217;s writing systems can fit in one code page, eliminating the need for separate code pages for each writing system. However, some websites and programs may still only use the local encoding, so beware!<\/p>\n<p>Here are some examples of code page mismatch for Russian text.<\/p>\n<div id=\"attachment_7409\" style=\"width: 610px\" class=\"wp-caption aligncenter post-item__attachment\"><a href=\"http:\/\/www.interfax.ru\/business\/422449\" aria-label=\"Interfax In Win 1251\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-7409\" class=\"wp-image-7409\"  alt=\"screenshot of Russian text\" width=\"600\" height=\"225\" \/ src=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_win-1251.jpg\" srcset=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_win-1251.jpg 751w, https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_win-1251-350x131.jpg 350w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><\/a><p id=\"caption-attachment-7409\" class=\"wp-caption-text\">Screenshot from <a href=\"http:\/\/www.interfax.ru\/business\/422449\">interfax.ru<\/a> in the correct Win-1252 encoding<\/p><\/div>\n<div id=\"attachment_7410\" style=\"width: 610px\" class=\"wp-caption aligncenter post-item__attachment\"><a href=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_koi8.jpg\" aria-label=\"Interfax In Koi8\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-7410\" class=\"wp-image-7410\"  alt=\"wrong display of Russian text\" width=\"600\" height=\"243\" \/ src=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_koi8.jpg\" srcset=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_koi8.jpg 748w, https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_koi8-350x142.jpg 350w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><\/a><p id=\"caption-attachment-7410\" class=\"wp-caption-text\">Same page encoded in KOI-8<\/p><\/div>\n<div id=\"attachment_7411\" style=\"width: 610px\" class=\"wp-caption aligncenter post-item__attachment\"><a href=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_win-1252.jpg\" aria-label=\"Interfax In Win 1252\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-7411\" class=\"wp-image-7411\"  alt=\"Same page encoded in win-1252\" width=\"600\" height=\"222\" \/ src=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_win-1252.jpg\" srcset=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_win-1252.jpg 750w, https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/interfax_in_win-1252-350x130.jpg 350w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><\/a><p id=\"caption-attachment-7411\" class=\"wp-caption-text\">Same page encoded in win-1252<\/p><\/div>\n<p>This <a href=\"http:\/\/www.w3.org\/International\/questions\/qa-what-is-encoding.en\">page<\/a> by the World Wide Web Consortium has an excellent, easy-to-understand explanation of encoding.<\/p>\n<h2>Fonts<\/h2>\n<p>A separate issue is having the proper Russian characters (technically, <a href=\"http:\/\/en.wikipedia.org\/wiki\/Glyph\">glyphs<\/a>) in the <a href=\"http:\/\/en.wikipedia.org\/wiki\/Font\">font<\/a>\u00a0(<strong>\u0448\u0440\u0438\u0444\u0442<\/strong>) you are using. Even if the\u00a0encoding matches the sequence of bits to the correct Russian character, that character needs to have a proper graphic representation in the typeface.<\/p>\n<p>In case of word processors, <span style=\"color: #800000\"><strong>how Russian text is displayed depends on what fonts are installed on your computer<\/strong><\/span> and whether these fonts have Russian characters.\u00a0In my example below, all of the first line is displayed in\u00a0Minion Pro because that font has Russian glyphs on my computer. It may be different on yours. The second line, however, uses Gloucester MT Extra Condensed, which does not have Russian glyphs, so the Russian word\u00a0switched to a typeface that does support Russian &#8212; Times New Roman.<\/p>\n<div id=\"attachment_7413\" style=\"width: 340px\" class=\"wp-caption alignnone post-item__attachment\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-7413\" class=\"size-full wp-image-7413\" src=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/russian_font.jpg\" alt=\"The same text in two different fonts. Note how the Russian word defaulted to Times New Roman in the second line.\" width=\"330\" height=\"126\" \/><p id=\"caption-attachment-7413\" class=\"wp-caption-text\">The same text in two different fonts. Note how the Russian word defaulted to Times New Roman in the second line.<\/p><\/div>\n<p>If\u00a0the system is not able to substitute the font, it will display square boxes or question marks in place of Russian characters. If that happens to you, try saving your text in a more common font.<\/p>\n<h2>Keyboarding<\/h2>\n<p>Finally, I imagine many of you would like to start typing in Russian if you aren&#8217;t already. If you are still intimidated by typing in Russian, a <a href=\"http:\/\/winrus.com\/keyboard.htm\">virtual keyboard<\/a>\u00a0(<strong>\u0432\u0438\u0440\u0442\u0443\u0430\u0301\u043b\u044c\u043d\u0430\u044f \u043a\u043b\u0430\u0432\u0438\u0430\u0442\u0443\u0301\u0440\u0430<\/strong>) may be a good first step. However, <span style=\"color: #800000\"><strong>using your current keyboard to input Russian is easier than you think.<\/strong><\/span> Here are instructions for <a href=\"http:\/\/learnrussian.rt.com\/speak-russian\/how-install-cyrillic-keyboard-windows-7\">Windows 7<\/a> and <a href=\"http:\/\/learnrussian.rt.com\/speak-russian\/how-install-cyrillic-keyboard-mac\">Mac<\/a>. In Windows, the default keyboard shortcut for switching the input language is alt\u00a0+ shift for Windows (Mac&#8217;s a bit more <a href=\"http:\/\/apple.stackexchange.com\/questions\/15173\/changing-keyboard-input-language-on-mac-osx\">tricky<\/a>).<\/p>\n<p>If you think you may at some point stay in Russia, do a study abroad, or conduct business there, I would recommend getting used to the\u00a0<a href=\"http:\/\/en.wikipedia.org\/wiki\/Keyboard_layout#Russian\">standard Russian layout<\/a> <strong>(\u0441\u0442\u0430\u043d\u0434\u0430\u0301\u0440\u0442\u043d\u0430\u044f \u0440\u0430\u0441\u043a\u043b\u0430\u0301\u0434\u043a\u0430)<\/strong> from the get-go because a phonetic layout may not be readily available on computers you will be using in Russia. Don&#8217;t feel bad about taking a bit to find the letter on the keyboard &#8212; many Russian computer users type this way despite years of practice.<\/p>\n<p>More information about Russian keyboarding is available on the slightly dated but still relevant\u00a0<a href=\"http:\/\/winrus.com\/\">WinRus site<\/a> and, of course, you are encouraged to practice your skills through the Transparent Language <a href=\"https:\/\/blogs.transparent.com\/language-news\/2013\/04\/29\/new-alphabet-courses-in-transparent-language-online\/\">Alphabet Courses<\/a>.<\/p>\n<p>Happy reading and typing! I hope to see some comments in Russian to this post!<\/p>\n","protected":false},"excerpt":{"rendered":"<img width=\"330\" height=\"126\" src=\"https:\/\/blogs.transparent.com\/russian\/wp-content\/uploads\/sites\/10\/2015\/02\/russian_font.jpg\" class=\"attachment-post-thumbnail size-post-thumbnail wp-post-image\" alt=\"\" decoding=\"async\" loading=\"lazy\" \/><p>As we are irreversibly living in a digital age, it is no longer possible to stick to books and handwritten letters in learning and practicing languages. Russian is no exception to this, and even beginners soon encounter Russian websites, social networks, and other digital content. However, many learners, especially those from Anglophone countries, often have\u00a0limited&hellip;<\/p>\n<p class=\"post-item__readmore\"><a class=\"btn btn--md\" href=\"https:\/\/blogs.transparent.com\/russian\/digital-russian-how-to-view-and-input-russian-text\/\">Continue Reading<\/a><\/p>","protected":false},"author":113,"featured_media":7413,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"content-type":""},"categories":[1],"tags":[349772,349771],"class_list":["post-7317","post","type-post","status-publish","has-post-thumbnail","hentry","category-uncategorized","tag-computing","tag-typing-in-russian"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/posts\/7317","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/users\/113"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/comments?post=7317"}],"version-history":[{"count":14,"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/posts\/7317\/revisions"}],"predecessor-version":[{"id":11079,"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/posts\/7317\/revisions\/11079"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/media\/7413"}],"wp:attachment":[{"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/media?parent=7317"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/categories?post=7317"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.transparent.com\/russian\/wp-json\/wp\/v2\/tags?post=7317"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}