Next:Introduction
Journal of Artificial Intelligence Research
18 (2003),
pp. 117-147. Submitted 7/02; published 2/03.
© 2001 AI Access Foundation
and Morgan Kaufmann Publishers.
All rights reserved.
Translation of Pronominal Anaphora between English and Spanish: Discrepancies
and Evaluation
Jesús Peral
Dept. Lenguajes y Sistemas Informáticos
University of Alicante
Alicante, SPAIN
Antonio Ferrández
Dept. Lenguajes y Sistemas Informáticos
University of Alicante
Alicante, SPAIN
jperal@dlsi.ua.es
antonio@dlsi.ua.es
Abstract:
This paper evaluates the different tasks carried out in the translation
of pronominal anaphora in a machine translation (MT) system. The MT interlingua
approach named AGIR (Anaphora Generation with an Interlingua Representation)
improves upon other proposals presented to date because it is able to translate
intersentential anaphors, detect co-reference chains, and translate Spanish
zero pronouns into English--issues hardly considered by other systems.
The paper presents the resolution and evaluation of these anaphora problems
in AGIR with the use of different kinds of knowledge (lexical, morphological,
syntactic, and semantic). The translation of English and Spanish anaphoric
third-person personal pronouns (including Spanish zero pronouns) into the
target language has been evaluated on unrestricted corpora. We have obtained
a precision of 80.4% and 84.8% in the translation of Spanish and English
pronouns, respectively. Although we have only studied the Spanish and English
languages, our approach can be easily extended to other languages such
as Portuguese, Italian, or Japanese.
Next:Introduction
Jesus Peral 2002-12-13