第293号 −− Context Dependency in Case of English-Japanese Machine Translation (I-7) −− 2004.04.26 |
Context Dependency in Case of English-Japanese Machine Translation (I-7)
(English summary)
In this article we investigate the examples of dependence on context found in
Lesson 7 of the first grade five English textbooks.
1. Should "those" be translated as "arera no" or "sorera no" (Part 1)
In the 5th article [2] of this series, we investigated the criterion for
deciding the Japanese translation for "that" between "ano" and "sono". As its
plural form version, we need the criteria for deciding the Japanese translation
of "those" between "arera no" and "sorera no". In the example sentence in [2],
the noun following "that" was different from eithr the subject or object of the
preceding sentence. Consequently "ano" was chosen. In this article, we present
a similar example in section 1, and another example for which "arera no" is
rejected and "sorera no" is adopted by the converse reason in section 5 below.
Kenji: Who are those girls? ............. (1)
(Kenji: Arera no shoujo wa dare desu ka?)
Sam: They are my friends. We go to the same school. ........ (2)
(Sam: Kanojo-tachi wa watashi no yuujin desu. Watashitachi wa onaji gakkou
ni iki-masu.)
The choice rule applied to decide the Japanese for "those" in (1) above is;
88;それらの;2;E0=those;E1<>people;DJ1<>SJ$;DJ1<>OJ$;DJ1<>SJ2$;DJ1<>OJ2$;E1<>*BS;
E1<>*BS2;D1<>回; d340;
This rule verifies that the noun directly following "those" is different from
either the subject or object of the preceding sentence or of the sentence before
the last, and that the noun is contained neither in the preceding English
sentence (BS) nor in the sentence before the last (BS2). Consequently the noun
doesn't refer to something already mentioned but presents something new. By
these reasons the rule rejects "sorera no" and thus adopts "arera no".
2. Example of translating "they" into "kanojo-tachi" (the second case)
In the previous article [3] of this series, we showed the example where "they"
was translated into "kanojo-tachi", but the translated word was annihilated in
the process of connecting the translated words. This time we present an example
in wich "kanojo-tachi" is preserved in the final result of the translation.
For "they" in the example (2) of the previous section, the choice rule is;
22;*彼女;2;SZ=W;SZ=x;OZ<>x;EA<>,; D990;
This rule checks that the attribute set of the subject of the previous sentence
SZ contains 'W' (woman) and 'x' (plural symbol 's' at the tail of the word) and
that the attribute set of the object of the preceding sentence OZ doesn't contain
'x', it selects "kanojo-tachi" as the Japanese for "they". In fact, the subject
of the preceding sentence is "those girls".
3. Example of translating "Mrs." as "fujin"
For the title "Mrs.", three Japanese words "san", "fujin" and "sensei" are
registered. When "Mrs. AA" appears with "Mr. AA" in pair, "Mrs. AA" will be
translated as "AA Fujin".
Taro: How do you do, Mr Hill?
(Taro "Hajime mashite, Hiru san.")
Becky's Father: Nice to meet you, Taro.
(Bekii no Otou-san "Anata ni aete ureshii-desu, Taro.")
Taro: How do you do, Mrs Hill? ........ (3)
(Taro "Hajime mashite, Hiru Fujin.")
The choice rule for "Mrs." in (3) above is;
88;*夫人;2;E0=Mrs;E-1=How do you do ,;T1=K;E1=*BS2;BS2=*How do you do, Mr; d130;
This rule requires that the preceding English sentence (BS) is "How do you do,
(someone)?", that the attribute of the noun directly after Mrs. is 'K' (proper
noun), that the proper noun is contained in the sentence before the last (BS2)
and that the English sentence before the last contains "How do you do, Mr. (
someone)?" When all these conditions are satisfied, the rule selects 'fujin' as
the adequate Japanese for "Mrs."
Remark: "Hajime mashite, Hiru Fujin." is, in fact, not so natural as a greeting
by a junior highschool student. The most natural translation would simply be
"Hajime mashite", neglecting the English part "Mrs. Hill". But here we do not
require the translation to that extent.
4. Decision of what "too" refers to (a case of pronoun object)
In the 4th article [1] of this series, we investigated whether ", too" refers
to the object or to the subject of the sentence in case ", too" follows a noun
object of a transitive verb. When the object of the verb is a pronoun, we are
unable to apply similar criteria. Let's look at the following example. We remark
that among the five English textbooks, only New Crown and New Crown (New Edition)
write as "too" at the end of a sentence not as ", too" as in the other
textbooks. Both expressions seem to be used in USA and the Great Britain.
Tom: Yes, I can. They are ki, hayashi and mori.
(Tomu: Hai, deki-masu. sorera (sorera no tango) wa ki, hayashi to mori desu.)
Kumi: Right. They mean 'tree', 'woods' and 'forest'. Do you know this too?
.............. (4)
(Kumi: Sono-toori desu. Sorera wa 'ki', 'hayashi' to 'mori' wo imi-shi-masu.
Anata wa kore-mo shitte-i-masu ka?)
In the process of connecting the translated Japanese words from bottom up,
the composition grammar rule to connect "wo shitte-iru (ru 7)", "kore" and
"mo mata" into "kore-mo shitte-iru (ru 7)" is;
5;326;3;1;LT;J3=?;E2=too;E2<>* ;E-1=you;BS=*Yes, I can.;J0#-格;J0#VR;J1#+も;
J2#Z; 9662;
This rule reqires as preconditions that this part is at the end of a sentence
(LT = LasT) followed by the interrogation mark '?' (and thus a question sentence)
and that a preceding English word (E-1) which seems to be the subject is "you".
So we here need to choose between "Anata-mo ...wo ...shi-masu ka?" and "Anata wa
....-mo ...shi-masu ka?" And the rule conclude to choose "Anata wa ....-mo ...
shi-masu ka?", if the preceding English sentence contains "Yes, I can." This
implies that the subject of the preceding sentence is "I", and that the subject
of the present sentece "you" refers to the same person. By the principle that
the Japanese postpostion "mo" (corresponding to English "too") will not be
attached to the already mentioned (pro)noun, "mo" is attached to the object, not
to the subject.
5. Should "those" be translated as "arera no" or "sorera no" (Part 2)
In section 1, we showed an example of selecting "arera no" for "those" since
the noun following "those" is different either from the subject or from the
object. In this section, we are going to show a contrary example of rejecting
"arera no" and thus adopting "sorera no" by virtue of the preceding sentence
information.
Tom: English has many riddles like this.
(Tomu: Eigo ni-wa kono youna ooku-no nazo ga ari-masu.)
Kumi: Good. I like those word games. .......... (5)
(Kumi: Sore wa yoi desu ne. Watashi wa sorera no kotoba-asobi ga suki desu.)
The choice rule for "those" applied to (5) above is;
88;*それらの;2;E0=those;E-1=like;BS=* many ;BS=* like this.; d337;
The "word games" refered to "those" does not explicitly appear in the preceding
sentence ("riddles" is paraphrased), but the preceding English sentence (BS)
contains "... many ..." and "... like this", and so the noun phrase succeeding
"those" is considered to point out what is mentioned in the preceding sentence,
and by consequence the rule chooses "sorera-no" for "those".
6. The choice for "good" among "yoi", "yoroshii" and "zen"
In our system dictionary, as the translations for "good", three Japanese words,
"yoi" (adjective), "yoroshii" (interjection) and "zen" (abstract noun) are
registered. If one replies "Good," in a conversation, we tend to select the
interjection "yoroshii" for "good", but there are some cases where it is better
to select "yoi" regarding the phrase as the abbreviation of "It's good." Such is
the case of example (5) in the previous section. For "good" at the head of
example (5), the following two choice rules are successively applied and is
finally determined as "yoi";
5X;*好い;2;E0=good;FT;LT;DJ-1=「;E2=I;E3=like;BS=* has many ;BS=* like this.;
V860; Xよろしい ; 5好い
15;*好い;2;E0=good;FT;LT;DJ-1=「;E2=I;E3=like;BS=* has many ;BS=* like this.;
9380; 5好い ; 1善
These rules check that "good" in question is a single word sentence (FT;LT) at
the head of a speech (DJ-1=「;) followed by a sentence "I like ...". Namely the
situation is; "Good. I like ..." Further they verify that the preceding English
sentence (BS) contain "... has many ..." and "... like this." "Good. I like ..."
is a reply to this. Therefore the speech seems to mean "That's nice. I like ...,
too," and the rules select "yoi" for "good".
And during the process of generating the Japanese translation, "sore wa" and
"desu-ne" are attached to respectively from left and from right, and the
translation for "Good." becomes "Sore wa yoi desu-ne."
1;5;X;0;FT;LT;E0=good;DJ-1=「;E1=.;PJS=があります。」;J0#+ですね;J0#それは+;
1427;
By the way, this rule requires as one of the precondtion that the preceding
Japanese translation (PJS) contains "ga ari-masu.」" at the end of the preceding
speech. In replay to such an utterance, "Good. ..." at the head of the answer
must be translated into Japanese not simply by replacing the word registered in
the dictionary but by further adding "sore wa" and "desu-ne" before and after
the registered word to make the Japanese translation more natural.
7. Choice between "oboete-iru" and "omoi-dasu" for "remember"
For English verb "remember", we have two Japanese "oboete-iru" (keep in memory)
and "omoi-dasu" (recover the memory of). They are used as follows;
Kumi: What day is it today?
(Kumi: Kyou wa nan-youbi desu ka?)
Tom: It's Friday. This afternoon our class collects cans.
Do you remember? ..................... (6)
(Tomu: Kin-youbi desu. Kyou no gogo watashitachi no kurasu wa kan wo
atsume-masu. Anata wa oboete-i-masu ka?)
Kumi: Now I do. Thanks. ..................(7)
(Kumi: Ima omoi-dashi-mashita. Arigatou.)
The choice rule for "remember" in (6) above is;
33;*をおぼえている;2;E0=remember;J1=?;E-1=you;E-2=do;E-3=.; M230;
This rule verifies that the sentence under translation includes "Dou you
remember?" following dirctly after some statement (E-3=.;). If this condition
is satisfied, the question mentioned above is translated as "Anata wa oboete-i-
masu ka?". This rule doesn't investigate the context inheriting the preceding
sentences.
But, what is interesting is the answer (5) to the question. The auxiliary verb
"do" refers to the verb "remember" in the preceding sentence (4). In English
that's all. But in Japanese, "remember" in the question (4) and "do" (=
"remember") in the answer (5) have different meanings and they should be
expressed by different verbs. So we can't replace "do" by the value "oboete-iru"
of the variable VR of the verb of the preceding sentence. Therefore the
generation grammar rule applied in the process of translation of (5) checks that
what is going to be translated is the answer to the question "Do you remember?",
and then rewrites the Japanese for "do" as "omoi-dashi-mashita" rejecting the
verb of the preceding sentence "oboete-iru";
3;24;X;0;LT;E0=I;E1=do;E2=.;E-1=now;DJ-2=「;BS=Do you remember?;J0#Z;J1#Z;
J1#+思い出しました; 6217;
8. Is the "It" in "It is date/time." a formal subject or refers to something?
The "It" of "It's Friday." at the head of example (6) in the previous section
is usually viewed as a formal subject and it's Japanese translation "sore" will
be annihilated during the process of generating the translation. The grammar
rule connecting "sore"(it) + "dearu"(be) + "Kin-youbi"(Friday) => "Kin-youbi
desu" is
3;231;X;1;FT;E0=it;T1=v;T2=t;E2<>* on;E3<>that;PJS<>*の日が来ました。;J0#Z;
J1#DES; 6910;
This rule requires as a precondition that the preceding Japanese translation
(PJS) does not contain "... no hi ga ki-mashita." If the preceding translated
sentence contains such expression, it may happen that we translate "It is date/
time." as "Sore (= it) wa date/time desu."
9. Should "now" be translated as "ima" or "saa"?
English word "now" is used as meaning "present time" (adverb, noun) as well as
"Well, .." (interjection) for promption or starting a new subject. In case of
"now" in example (7) in the two sections back, the system applies the following
choice rule with very explicit conditions;
6X;*今;2;E0=now;FT;E1=I;E2=do;E3=.;DJ-1=「;BS=Do you remember?; Z937;
6今 ; Xさあ
This rule checks that the preceding English sentence (BS) is equal to "Do you
remember?" and that the present sentence is "Now I do. ...". When the conditions
are satisfied, this rule select "Ima" for the "Now" at the head. The conditions
above are so concrete since the criterion for such "now" strongly depends on the
lexical particularity of the verb "remember" in the preceding sentence which
would hardly be generalized.
<< References >>
[3] Katsuyuki SHIBATA: Context Dependency in case of English-Japanese Machine
Translation I-4.
http://www1.rsp.fukuoka-u.ac.jp/chosho/Cntxt1-4.html
[3] Katsuyuki SHIBATA: Context Dependency in case of English-Japanese Machine
Translation I-5.
http://www1.rsp.fukuoka-u.ac.jp/chosho/Cntxt1-5.html
[3] Katsuyuki SHIBATA: Context Dependency in case of English-Japanese Machine
Translation I-6.
http://www1.rsp.fukuoka-u.ac.jp/kototoi/2004_4E.html#No290
|