{"id":3566,"date":"2015-09-02T07:58:15","date_gmt":"2015-09-02T11:58:15","guid":{"rendered":"https:\/\/blogs.transparent.com\/language-news\/?p=3566"},"modified":"2020-10-01T14:45:40","modified_gmt":"2020-10-01T18:45:40","slug":"how-google-translate-works-and-why-it-doesnt-measure-up","status":"publish","type":"post","link":"https:\/\/blogs.transparent.com\/language-news\/2015\/09\/02\/how-google-translate-works-and-why-it-doesnt-measure-up\/","title":{"rendered":"How Google Translate Works, and Why It Doesn\u2019t Measure Up"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">With over 200 million daily translations, there\u2019s no denying that Google Translate is a wildly popular translation service. Indeed, machine translation has come very far since its infancy in the early 2000s. Instead of translating words at face value, machine translators have developed complex algorithms to deliver more accurate translations, and some even take into account colloquial language and idioms. Still, the very nature of machine translators prevents them from ever doing a human\u2019s job. Let\u2019s take a look at how machine translators (such as Google Translate) work, what their limitations are, and why they can\u2019t replace the quintessential human touch.<\/span><\/p>\n<div id=\"attachment_7177\" style=\"width: 970px\" class=\"wp-caption aligncenter post-item__attachment\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-7177\" class=\"wp-image-7177 size-full\" src=\"https:\/\/blogs.transparent.com\/language-news\/wp-content\/uploads\/sites\/28\/2015\/09\/translate-1820325_960_720.png\" alt=\"machine translation\" width=\"960\" height=\"528\" srcset=\"https:\/\/blogs.transparent.com\/language-news\/wp-content\/uploads\/sites\/28\/2015\/09\/translate-1820325_960_720.png 960w, https:\/\/blogs.transparent.com\/language-news\/wp-content\/uploads\/sites\/28\/2015\/09\/translate-1820325_960_720-350x193.png 350w, https:\/\/blogs.transparent.com\/language-news\/wp-content\/uploads\/sites\/28\/2015\/09\/translate-1820325_960_720-768x422.png 768w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/><p id=\"caption-attachment-7177\" class=\"wp-caption-text\">Image via Pixabay<\/p><\/div>\n<p><b>How machine translation works<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Google Translate, as well as other machine translators, operate on statistics rather than rules. That is, they look for patterns in hundreds of millions of documents that have already been translated by human translators. Google Translate makes special use of UN documents, which are translated in all six official UN languages, and thus provide ample linguistic data. This way, they can weigh a plethora of options for phrases presented by various different (human) translations, and select an educated guess based on the one that occurs most frequently. For example, they detect that, in Spanish, the phrase \u201cdarse cuenta\u201d is usually translated as \u201crealize\u201d in English. Therefore, based on statistics, Google Translate will correctly translate the phrase as \u201crealize\u201d, rather than a word-for-word translation, which would appear more like \u201cgive account\u201d.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Finding linguistic data large enough to create legitimate statistical analyses is no easy feat. Given that more documents are available in English than in any other language, the data almost always uses English as an intermediary step when translating between two languages that <\/span><i><span style=\"font-weight: 400;\">aren\u2019t <\/span><\/i><span style=\"font-weight: 400;\">English. For example, when translating from Russian to Spanish, Google Translate will first translate the text from Russian to <\/span><i><span style=\"font-weight: 400;\">English, <\/span><\/i><span style=\"font-weight: 400;\">and then from English into Spanish. As a result, when translating in languages other than English, machine translations actually involve two iterations.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In fact, some language pairs involve even more iterations. If you want to translate some text from, say, Catalan to Japanese, Google will translate it first into Spanish, as most existing Catalan translations are in Spanish. Then, this translated Spanish-language version of the original Catalan text will be translated into English. And finally, the English version of the Spanish version of the Catalan text will finally make it to Japanese &#8212; and if you\u2019re lucky, it will still bear some resemblance to the original meaning.<\/span><\/p>\n<p><b>Why it doesn\u2019t make the cut<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Google Translate does a good job with very basic translations &#8212; especially those whose target language is English &#8212; and now even offers alternative interpretations for certain words and phrases. However, the very methodology upon which Google Translate is based prevents it from ever competing with human translators. Here\u2019s why:<\/span><\/p>\n<p><b>Statistics don\u2019t have feelings. <\/b><span style=\"font-weight: 400;\">Google Translate is based on statistics &#8212; it chooses the \u201cbest\u201d translation based on how certain words and phrases have been translated in other documents. As a result, machine translators choose the <\/span><i><span style=\"font-weight: 400;\">most probable <\/span><\/i><span style=\"font-weight: 400;\">translation, but not the most interesting or poetic one. As a result, even if translations are accurate (which they often aren\u2019t), they adopt a robotic, lifeless tone. It takes a human translator, with feelings and creativity, to reproduce the tone, color, and vibrancy of the original text.<\/span><\/p>\n<p><b>Machine translations struggle with complex grammar. <\/b><span style=\"font-weight: 400;\">Language is based on rules, and as a result, a statistics-based translator like Google will struggle with complex grammatical concepts, such as the difference between the imperfect and preterite past tenses in Romance languages. This is especially true given that Google almost always uses English &#8212; a language that does not grammatically distinguish between preterite and imperfect tenses &#8212; as an intermediary step when translating into Romance languages. Therefore, Google Translate often incorrectly translates the imperfect past as the preterite past (and vice versa), making ongoing or habitual acts seem like one-time, completed events.<\/span><\/p>\n<p><b>Google can\u2019t write for an audience. <\/b><span style=\"font-weight: 400;\">Every translator knows that you need to tailor your work to whom you\u2019re writing for. For example, if this article were written for a casual blog, my use of the word \u201cwhom\u201d in the previous sentence may come off as overly formal. However, given that this article appears on a language interest blog, grammarians and language experts may applaud my correct distinction between \u201cwho\u201d and \u201cwhom\u201d (though they may scoff at my decision to end a sentence in a preposition). Machines cannot make such judgment calls &#8212; Google cannot take into account who the intended audience is for the article it translates. Only a human translator can make that kind of decision.<\/span><\/p>\n<p><b>Google Translate vs. a human being<\/b><\/p>\n<p><span style=\"font-weight: 400;\">To illustrate the difference between Google Translate and a living, breathing human translator, I will employ both to translate the following text in English, which appears on a website selling <\/span><a href=\"http:\/\/www.casabianchi.com.ar\/es\/vip\/enzoBianchi\"><span style=\"font-weight: 400;\">Argentine wine<\/span><\/a><span style=\"font-weight: 400;\">. Try to guess which one was written by a human, and which was produced by a machine (spoiler alert: it won\u2019t be hard).<\/span><\/p>\n<blockquote><p><span style=\"text-decoration: underline;\"><span style=\"font-weight: 400;\">Original text<\/span><span style=\"font-weight: 400;\">:<\/span><\/span><\/p>\n<p><i><span style=\"font-weight: 400;\">Despu\u00e9s de una excelente cosecha como la que le precedi\u00f3, la cosecha 2009 muestra sus virtudes en este vino base Cabernet Sauvignon, mas el ensamble de tres variedades de gran personalidad que encontraron en San Rafael el terru\u00f1o ideal para la expresi\u00f3n de sus mejores cualidades. Vino aun de color rojo viol\u00e1ceo intenso a pesar de los a\u00f1os en botella, ya en la copa se nos muestra intenso y seductor con aromas especiados que se entremezclan con n\u00edtidos y frescos aromas a frutas de ciruelas, cerezas negras y moras, mientras que se van desprendiendo lentamente los aromas tostados que recuerdan a granos de caf\u00e9 molidos.<\/span><\/i><\/p>\n<p><span style=\"text-decoration: underline;\"><span style=\"font-weight: 400;\">Translation 1<\/span><span style=\"font-weight: 400;\">:<\/span><\/span><\/p>\n<p>After an excellent harvest like the one that preceded it, the 2009 vintage shows its virtues in this Cabernet Sauvignon base wine, plus the assembly of three varieties of great personality that found in San Rafael the ideal terroir for the expression of its best qualities. It was still intense violet red despite years in the bottle, already in the glass it is intense and seductive with spicy aromas that intermingle with clear and fresh aromas of plums, black cherries and blackberries, while leaving slowly releasing toasted aromas reminiscent of ground coffee beans.<\/p>\n<p><span style=\"text-decoration: underline;\"><span style=\"font-weight: 400;\">Translation 2<\/span><span style=\"font-weight: 400;\">:<\/span><\/span><\/p>\n<p><span style=\"font-weight: 400;\">After a great harvest like the one that preceded it, the 2009 harvest shows its virtues in this cuv\u00e9e Cabernet Sauvignon. It\u2019s an expressive mixture, articulated by three varieties of great personality that are found in San Rafael, the perfect region to bring out its best qualities. The wine still preserves a strong purplish-red color, in spite of the amount of years gone by since it was bottled. Once poured into the glass, it remains intense and seductive. Its spiced scents mix together with clear and fresh fruit aromas of plum, black cherry and blackberry, while its toasted scents release slowly, reminiscent of the delicious smell of ground coffee beans.<\/span><\/p><\/blockquote>\n<p><span style=\"font-weight: 400;\">You probably guessed right: the first translation was done by Google; the second, by a professionally trained bilingual translator. As you can see, the machine translation is comprehensible, but inelegant and sometimes confusing. On the other hand, the human translation flows smoothly, is organized coherently, and matches the elegant tone of the original article.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Machine translation isn\u2019t without its perks. It can be a life-saver in a pinch, when you need to get a rough idea of what a certain phrase means, or when you need to decipher a street sign while traveling to a foreign country. However, when it comes to translating important documents, the limitations of machine translation prevent it from being a viable option. As the above examples demonstrate, a translation that is based on statistical patterns will never match the quality of one created by a professional, who understands the rules and nuances of language. Indeed, machine translation has come a long way, but it\u2019s still far from replacing the human touch.<\/span><\/p>\n<p><i><span style=\"font-weight: 400;\">The following post is from Paul, an English teacher who lives in Argentina. Paul writes on behalf of <\/span><\/i><a href=\"http:\/\/www.languagetrainers.com\"><i><span style=\"font-weight: 400;\">Language Trainers<\/span><\/i><\/a><i><span style=\"font-weight: 400;\">, a language teaching service which offers <\/span><\/i><a href=\"https:\/\/www.languagetrainers.com\/reviews\/foreign-film-reviews\/\"><i><span style=\"font-weight: 400;\">foreign language movie reviews<\/span><\/i><\/a><i><span style=\"font-weight: 400;\"> as well as other free language-learning resources on their website. Check out their <\/span><\/i><a href=\"http:\/\/www.facebook.com\/languagetrainers\"><i><span style=\"font-weight: 400;\">Facebook page<\/span><\/i><\/a><i><span style=\"font-weight: 400;\"> or send an email to paul(at)languagetrainers.com for more information.<\/span><\/i><\/p>\n","protected":false},"excerpt":{"rendered":"<img width=\"350\" height=\"193\" src=\"https:\/\/blogs.transparent.com\/language-news\/wp-content\/uploads\/sites\/28\/2015\/09\/translate-1820325_960_720-350x193.png\" class=\"attachment-post-thumbnail size-post-thumbnail wp-post-image\" alt=\"machine translation\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/blogs.transparent.com\/language-news\/wp-content\/uploads\/sites\/28\/2015\/09\/translate-1820325_960_720-350x193.png 350w, https:\/\/blogs.transparent.com\/language-news\/wp-content\/uploads\/sites\/28\/2015\/09\/translate-1820325_960_720-768x422.png 768w, https:\/\/blogs.transparent.com\/language-news\/wp-content\/uploads\/sites\/28\/2015\/09\/translate-1820325_960_720.png 960w\" sizes=\"auto, (max-width: 350px) 100vw, 350px\" \/><p>With over 200 million daily translations, there\u2019s no denying that Google Translate is a wildly popular translation service. Indeed, machine translation has come very far since its infancy in the early 2000s. Instead of translating words at face value, machine translators have developed complex algorithms to deliver more accurate translations, and some even take into&hellip;<\/p>\n<p class=\"post-item__readmore\"><a class=\"btn btn--md\" href=\"https:\/\/blogs.transparent.com\/language-news\/2015\/09\/02\/how-google-translate-works-and-why-it-doesnt-measure-up\/\">Continue Reading<\/a><\/p>","protected":false},"author":1,"featured_media":7177,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"content-type":""},"categories":[542801],"tags":[],"class_list":["post-3566","post","type-post","status-publish","has-post-thumbnail","hentry","category-archived-posts"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/posts\/3566","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/comments?post=3566"}],"version-history":[{"count":11,"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/posts\/3566\/revisions"}],"predecessor-version":[{"id":7178,"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/posts\/3566\/revisions\/7178"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/media\/7177"}],"wp:attachment":[{"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/media?parent=3566"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/categories?post=3566"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.transparent.com\/language-news\/wp-json\/wp\/v2\/tags?post=3566"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}