An introduction to machine translation for localisation

Machine learning has made its way into nearly every industry, and game localization is no exception. Software providers claim that their machine translation products mark a new era in localization, but gamers are often left wishing that game publishers would pay more attention to detail.

As a professional localization company that currently is working with machine translation post-editing, Alconost could not pass up the topic. In this article we aim to find out what’s hot (and what’s not) about machine translation (MT) and how to get the most out of it without sacrificing quality.

NMT in a nutshell: what on Earth is it?

When machine learning was introduced to localization, it was seen as a great asset, and for quite a while localization companies worked using the PEMT approach. PEMT stands for post-edited machine translation: it means that after a machine translates your text, translators go through it and edit it. The main problem with PEMT is that the machine translates without comparing the text to previous or current translations and a glossary — it just translates as it “sees” it. So naturally this method results in numerous mistakes, creating a need for manual editing.

As time passed and technology advanced, NMT (neural machine translation) came into play. This proved a much more reliable and robust solution. NMT uses neural networks and deep learning to not just translate the text but actually learn the terminology and its specifics. This makes NMT much more accurate than PEMT and, with sufficient learning, delivers high-quality results much faster than any manual translation.

NMT solutions: a brief overview

It’s no surprise that there are dozens of ready-made NMT solutions on the market. These can be divided into two main categories: stock and custom NMT engines. We will talk about custom (or niche-specific) NMT tools a bit later; for now, let’s focus on stock NMT.

Stock NMT engines are based on general translation data. While these datasets are vast and rich (for example, Google’s database), they are not domain-oriented. This means that when using a stock NMT tool you get a general understanding of the text’s meaning, but you don’t get an accurate translation of specific phrases and words.

Examples of stock NMT engines include Google Cloud Translation, Amazon Translate, DeepL Translator, CrossLang, Microsoft Translator, Intento, KantanMT.

The chief advantage of these solutions is that most of them are public and free to use (like Google Translate). Commercial stock NMTs offer paid subscriptions with their APIs and integration options. But their biggest drawback is that they don’t consider the complexity of game localization. More on that below.

The many intricacies of game localization

While machine translation works fine in many industries, game localization turned out to be a tough nut to crack. The main reason for this is that gaming (regardless of the type of game) always aims for an immersive experience, and one core part of that experience is natural-sounding dialogue and in-game text. So what’s so challenging about translating them properly?

  • Creativity

It may sound like a given, but creativity plays a massive role in bringing games to life, especially when it comes to their translation. A translator might have a sudden flash of inspiration and come up with an unexpected phrasing or wording that resonates with players much better than the original text.

Can a machine be creative? Not yet. And that means that machine translations will potentially always lack the creative element that sometimes makes the whole game shine.

  • Specific phrasing, dialects, and slang

One of the biggest challenges in localization is making the translation sound as natural as possible. And since every country and region has its own specific languages and dialects, it takes a thorough understanding of one’s culture to successfully adapt a translation to it.

While a machine learning solution can be trained on an existing database, what if it comes across a highly specific phrase that only locals know how to use? This is where professional translation by native speaking linguists and community feedback are highly helpful. Input from native speakers of the target language who know its intricacies can advise on the best wording. And for that, you need to have a feel for the language that you’re working with, not just theoretical knowledge.

  • Tone and overall vibe

Certain words convey a certain tone, and this is something that we do without thinking, just by feel. So when translating a game, a human translator can sense the overall vibe of the game (or of a specific dialogue) and use not just the original wording but synonyms that better convey the tone and mood. Conversely, a machine is not able to “sense the mood,” so in some cases the translation may not sound as natural as it could.

Advantages of using MT for game localization

Despite all the challenges around game localization, machine translation still does a pretty decent job. This technology has several significant benefits that make MT a great choice when it comes to certain tasks.

  • Speed

Speed is probably the biggest benefit of machine translation and its unique selling point. A machine can translate massive chunks of text in mere minutes, compared to the days or even weeks it would take a translator. In many cases it proves faster and more efficient to create a machine translation first and then edit it. Besides, the speed of MT is very handy if you need to quickly release an update and can manage with “good enough” translation quality.

  • Translation of out-of-game content

When talking about game localization, the first thing that comes to mind is usually in-game dialogue. But game localization is much more than that: it includes user manuals, how-tos, articles, guides, and marketing texts. This kind of copy doesn’t employ much creativity and imagery, since these materials don’t really impact how immersive the gaming experience will be. If a user spots a mistake while reading your blog, it’s less likely to ruin the game experience for them.

  • Cost of services

One more huge advantage of machine translation is its relatively low cost. Compared to the rates of professional translators, machine translation tends to be more affordable. Hence, it can save you money while letting you allocate experts to more critical tasks.

  • Consistency of translation

One more way MT can benefit your project is translation consistency. When several independent translators work on a text, they may translate certain words differently, so that you end up with different translations. But with machine translation repetitive phrases are always translated the same way, improving the consistency of your text.

Is machine translation good enough for games?

MT is not 100% accurate, according to gamers. For example, a recent Reddit discussion features hundreds of comments left by frustrated gamers, the majority of whom say the same thing: companies are going for fast profits instead of investing in high-quality translation. And what’s the tool to deliver quick results that are “good enough”? You guessed it — machine translation.


Alconost’s Kris Trusava

Unfortunately, when gaming companies try to release games faster it leads not only to a poor user experience but also to a significant drop in brand loyalty. Many gamers cite poor translations as one of the biggest drawbacks of gaming companies.

So what options are there when Google NMT isn’t enough? Here’s an idea for what might work best.

  • Localization-specific NMT

While neural machine translation has certain flaws, it has many benefits as well. It’s quick, it’s moderately accurate, and it can actually be quite helpful if you need to quickly translate massive amounts of documents (such as user manuals). So what we see as the perfect solution is niche-oriented, localization-specific NMT (or custom NMT).

For instance, Alconost is currently working on a product that uses neural machine learning and a vast database of translations in different languages. This lets us achieve higher accuracy and adapt the machine not just for general translation, but for game translation — and there is a big difference between the two. In addition, we use cloud platforms (such as Crowdin and GitLoсalize) with open-source data. That means that glossaries and translation memories from one project can be used for another. And obviously our translators post-edit the text to ensure that the translation was done right.

Custom domain-adapted NMT solutions may become a milestone in localization, as they are designed with a specific domain in mind. Their biggest advantages are high translation accuracy, speed, affordability (as they’re cheaper than hiring professional translators), and the option to explore new niches and domains.

Some content, such as user reviews, sometimes goes untranslated because it is too specific and there is not much of it. It wouldn’t make much sense to use a stock NMT solution for their translation, as it would require heavy post-editing.

Custom NMT tools, however, can be designed to work with user reviews and “understand” the tone of voice, so that even this specialized content can be translated by a machine. This solution has been implemented by Airbnb, where reviews and other user-generated content are translated in a flash just by pressing the “Translate” button.

In addition, machine translators can be trained to recognize emotions and mood and, when paired with machine-learning classifiers, to label and prioritize feedback. This can also be used to collect data on users’ online behavior, which is a highly valuable asset to any company.

How machine translation differs from traditional localization

Finally, let’s talk about the intricacies of localizing a text translated by a machine, and how the process differs from standard localization. We’ll compare the two approaches based on our own experience acquired while working on different projects.

  • Process

When we localize a project from scratch, it’s safe to say we are in full control of the quality, since the team has glossaries and context available from the start. Here the text is translated with a specific domain in mind, and only rarely do we have to post-edit the translated copy.

With machine translation, however, things are a bit different. The source text can be translated by different engines, all of which differ in terms of quality and accuracy. So when we start working with these texts, we request all available materials (style guides, glossary, etc.) from the client to ensure that the translation fits the domain and the brand’s style. This means that post-editing machine translations requires the additional step of assessing the quality and accuracy for the given project.

  • Quality

When you choose a traditional localization approach, there is a 99% chance that your project will be assigned to a person who has the most experience with your particular language and domain.

But with machine translation you can’t really be sure how well the machine has been trained and how much data it has for different languages. One engine may have learned 10,000 pages of Spanish-English translations, while another engine has studied 1,000,000 pages. Obviously, the latter is going to be more accurate.

The bottom line is that when working with a machine translation engine “trained” by a professional localization company on niche topics, there’s an excellent chance that they’ll ensure the “proficiency” of the customized MT engine and, consequently, the quality of the translation. With an ample translation database and professional editors by side, you can put your mind at ease, knowing that your project is in good hands.

Kris Trusava is localization growth manager at Alconost, a provider of localization services for games and other software into over 80 languages.

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

Related Posts
TLM Partners acquires Heroes & Generals business operations thumbnail

TLM Partners acquires Heroes & Generals business operations

Publisher aims to expand the World War II strategy game's commercial activity and continue its developmentYesterday TLM Partners announced that it has acquired the business operations of World War II strategy game Heroes & Generals.The transfer was made from the title's developer, Danish game studio RETO MOTO.The World War II themed Heroes & Generals was…
Read More
Where ‘Venom’ Goes After That Mid-Credits Scene thumbnail

Where ‘Venom’ Goes After That Mid-Credits Scene

[This story contains spoilers for Venom: Let There Be Carnage.] Venom has once again made a bond with audiences, and if the weekend’s box office predictions hold, a third installment is guaranteed. Venom: Let There Be Carnage certainly leaves the door open for more symbiote stories to be told. Tom Hardy has expressed his eagerness…
Read More
Listen to Charli XCX and Rina Sawayama’s New Song “Beg for You” thumbnail

Listen to Charli XCX and Rina Sawayama’s New Song “Beg for You”

Charli XCX has shared the new song “Beg for You,” featuring Rina Sawayama. It’s the latest single from her upcoming album Crash—due March 18. London-based musician Digital Farm Animals produced “Beg for You.” Check it out below.Last November, Charli XCX released Crash’s single “New Shapes,” a collaboration with Caroline Polachek and Christine and the Queens.…
Read More
The brand new "Dream World Online" brand classics are renewed, opening up new horizons thumbnail

The brand new “Dream World Online” brand classics are renewed, opening up new horizons

[摘自新聞稿]雷爵網絡旗下Q版高規格回合制奇幻遊戲《夢想世界Online》於今(20)日推出全新版本「招牌新視界」,招牌經典煥新,勇闖新界「滄海州」、以及全新樣貌和時裝首發「仙雲裳」、「蒼鸞羽」,帥氣雙人坐騎「滄海游龍」;多樣召喚獸「兔劍客」、「御靈少女」、「兔神使者」、「烈火神將」,以及全新背飾「鏡山水」、「明月遊」讓你全身上下都帥氣有勁!更多玩法以及江湖秘寶新賽季,都將在本次版本做釋出。勇闖新界 全新視野新地圖滄海洲滄海洲是夢想大陸西方海上的一座島嶼,常年洶湧的浪潮隔絕了夢想大陸與滄海洲。隨著時代變遷,島上的遺民們逐漸遷居離去,這片島嶼也陷入了長久的孤寂。在漫長的年歲更替裡,島上的靈力孕育出了夢幻的景色和屬於自己的生命。直到最近魔族在幻境海岸之西發現了這片土地,打破了滄海洲的安寧,也使滄海洲與夢想大陸的故事由此展開。滄海洲NPC在漫長的年歲更替中,滄海洲的靈力孕育出了屬於自己的生命,島嶼的靈力就是他源源不斷的力量,但因此他也無法輕易的離開出生的地方。孤寂的歲月裡,他漸漸掌握了搭建幻境的力量。在和夢想大陸的俠士們接觸前,他依靠著島嶼的靈力和強大的幻術獨自抵抗著魔族的侵擾。大禹治水副本全新推出昔者,中原之地洪水滔天,鯀竊天帝息壤以堙洪水。天帝知曉,令祝融殺鯀於羽郊。 洪水再度氾濫、淹沒山陵,百姓流離失所。鯀之子禹,憐憫眾生之苦,誓要治理洪水。 正當禹苦於獨木難支時,一群來自夢想大陸的英雄穿越時空助禹堙洪水、降共工、定九州。全新樣貌 時裝首發重磅時裝首發顏值館新增了時裝分頁,時裝有效期為90天,時裝擁有期間內可額外獲得10點屬性,屬性類型與職業相關,與背飾屬性不同時生效。時裝開放購買後,擁有背飾的玩家可獲得10點人物屬性,屬性類型與職業相關,與時裝屬性不同時生效。仙雲裳以雲為紗、霓為帛、玉為佩,遇滄海之造化,化而為裳。蒼鸞羽 傳聞在遙遠不可知之處,有古仙山昆侖,山上居住西王母,仙鳥青鸞伴其左右。西王母采昆侖金玉,集青鸞翎羽,聚天地靈氣,數百年光陰,最終編織成仙衣"蒼鸞羽",在傳聞中,身披"蒼鸞羽"者,如有仙姿,氣運加身,能於濁世中脫塵,萬眾中卓穎,是難得的仙家寶物。馭獸萌寵 組團報到(一).新手萌寵兔劍客仗劍走天涯的兔劍客,武藝高超,除一柄巨劍外身無長物,所做之事皆是為了磨煉自己的劍術,直至巔峰。(二).新培養寵御靈少女她是從小古靈精怪的少女,它是出身幽冥黑夜的密使;她手持四角魄燈,頭戴金珠釵冠;它身披紫炎戰鎧,拳握星辰瀚海;他們相約遊歷夢想大陸,不離不棄。兔神使者來自西域的神秘兔族少女,據說是得到神授命的公正使者,唯一喜歡做的事就是向別人炫耀她的天秤:"這可是我花大價錢淘來的寶貝,嘻嘻。"(三).飛昇新寵烈火神將在烈焰圍繞的禁地,沉寂已久的他睜開了如火焰般的黃金瞳,當他重新拿起他的戰錘,身上的盔甲也隨著他的甦醒,燃起熊熊烈焰,他的戰意也將再次燃起。(四).新雙人坐騎滄海游龍星辰之力環繞,五行靈光懸照;集天地神秀,吸陰陽極氣;游龍翔天際,滄海揚仙氣,馭龍遨遊于滄海秘地,所向披靡。顛峰之戰 決戰江湖魔族最近的動作已經越來越大,值此危機時刻,天界決定擴大165級賽事的選拔範圍,以165級天組和165級地組的形式參加比賽,並新增了155級賽事,共同禦敵。全新《夢想世界Online》是雷爵網絡推出的高規格回合戰鬥線上遊戲,擁有富有策略的戰鬥系統,11個流派421種招式,相生相剋變幻無窮。獨創電影式副本,帶你穿梭古代未來,結交關羽,飛上太空戰機械之王,探索超乎想像力的奇幻世界。隨心打造個性角色,華麗翅膀、拉風坐騎、多彩光環、超萌小精靈等自由搭配。結交遊戲好友,遇上唯美愛情,一起闖蕩武俠江湖。更多改版內容請參考「招牌新視界」改版活動頁,敬請持續關注官方網站。
Read More
Jodie Turner-Smith wants to make more Murder Mystery films thumbnail

Jodie Turner-Smith wants to make more Murder Mystery films

Jodie Turner-Smith wants to be in the next Murder Mystery movie. In the new Netflix sequel, Jennifer Aniston and Adam Sandler play married private detectives Nick and Audrey Spitz, who conduct a murder investigation after someone winds up dead at a wedding on a private island. Turner-Smith joined the mystery comedy for its second instalment
Read More
An impressive 7.3 kg octopus caught near Montpellier, a record thumbnail

An impressive 7.3 kg octopus caught near Montpellier, a record

La sortie de pêche a livré une sacrée surprise à Alexandre ce mardi 5 octobre, à Carnon (Hérault). C'est un poulpe record qui a été pêché par Alexandre, un chasseur sous-marin, ce mardi 5 octobre près de la digue de Carnon. Un céphalopode d'un poids record, 7,320 kg, enregistré après l'avoir rincé par le pêcheur. "La période…
Read More
Index Of News