supporting library has never been done.
2.1 Machine Translation
There are three different kind of machine translation.
The rule-based method is a technique that
uses standard language rules in the process of
transliteration (Rahman et al., 2014; Dewantara et al.,
2013). Hansel (2009) states that statistical machine
translation utilizes a machine translation paradigm
in which the translation results are generated on the
basis of statistical models using parameters obtained
from the analysis of the collections of parallel
two-language texts. The neural machine translation
is a new feature of google translate that works by
translating all sentences at once, so the translation
looks more natural, accurate and not weird when it is
read.
In the research of Nasution, et.al (2017), Machine
Translation (MT) is very useful in supporting
multicultural communication. Existing Statistical
Machine Translation (SMT) which requires high
quality and quantity of corpora and Rule-Based
Machine Translation (RBMT) which requires
bilingual dictionaries, morphological, syntax, and
semantic analyzer are scarce for low-resource
languages. Due to the lack of language resources,
it is difficult to create MT from high-resource
languages to low-resource languages like Indonesian
ethnic languages. Nevertheless, Indonesian ethnic
languages’ characteristics motivate us to introduce
a Pivot-Based Hybrid Machine Translation (PHMT)
by combining SMT and RBMT with Indonesian as
a pivot which we further utilize in a multilingual
communication support system(Nasution et al., 2017;
Panggabean, 2016).
2.2 Pivot-based Hybrid Machine
Translation
In the research of Nasution, et.al (2018), Google
Translate service and bilingual dictionary service
were combined as a composite service in the language
grid. There are more than a hundred high-resource
languages available in the Google Translate service.
To this date, two Indonesian ethnic languages, i.e.,
Javanese and Sundanese, are available in Google
Translate service alongside the official language,
Indonesian (Nasution et al., 2018; Nugroho, 2005).
It is unlikely that Google Translate can provide
the rest of Indonesian ethnic languages in the near
future, since the available corpora for Indonesian
ethnic languages are still scarce. In order to
bridge the gap between high-resource languages and
low-resource languages, in this case between English
and Minangkabau, a quicker approach is to create
an English-Minangkabau PHMT with Indonesian
as the pivot. Since Minangkabau has 61.59%
lexical similarity with Indonesian based on ASJP,
the morphology and syntax are similar. Therefore,
Indonesian-Minangkabau word-to-word translation is
expected to be acceptable.
2.3 Language Grid
Toru Ishida (2018) mentioned that globalization
increasingly demands multilingual communication
on the Internet, as well as in local communities.
To create customized collaboration tools to support
multilingual communities, the Language Grid was
established ten years ago. It has been improving
web-based services to communities throughout the
world by providing highly adaptable infrastructure
and access to a wide variety of language resources
and services (Ishida et al., 2018; Nasution et al., 2017;
Nasution, 2018).
3 RESEARCH METHOD
3.1 System Overview
Based on the results of the research analysis, it
can be concluded that the Augmented reality-based
Translating Machine has two criteria. This
Augmented reality-based Translating Machine can
interact with users by translating text from Indonesian
into English and vice versa, and by displaying
sound as the result of translation and animated 3D
characters. Augmented reality-based Translating
Machine is markerless, which means that it does not
use printed markers to display 3D animation models.
Figure 1 explains the bird view of process from
input in the form of text to output in the form of
animation object and speech translation results.
Figure 1: Whole System Overview.
ICoSET 2019 - The Second International Conference on Science, Engineering and Technology
164