Building a resource for studying translation shifts

This paper describes an interdisciplinary approach which brings together the fields of corpus linguistics and translation studies. It presents ongoing work on the creation of a corpus resource in which translation shifts are explicitly annotated. Translation shifts denote departures from formal corre...

Author: Cyrus, Lea
Division/Institute:FB 09: Philologie
Document types:Article
Media types:Text
Publication date:2006
Date of publication on miami:21.06.2006
Modification date:30.01.2015
Edition statement:[Electronic ed.]
Source:Proc. LREC 2006. Genoa, May 24-26 (2006), S. 1240-1245
Subjects:Korpuslinguistik; Computerlinguistik; syntaktische Annotation; semantische Annotation
DDC Subject:400: Sprache
License:InC 1.0
Language:English
Format:PDF document
URN:urn:nbn:de:hbz:6-92619494486
Permalink:http://nbn-resolving.de/urn:nbn:de:hbz:6-92619494486
Digital documents:0605_lrec.pdf

This paper describes an interdisciplinary approach which brings together the fields of corpus linguistics and translation studies. It presents ongoing work on the creation of a corpus resource in which translation shifts are explicitly annotated. Translation shifts denote departures from formal correspondence between source and target text, i. e. deviations that have occurred during the translation process. A resource in which such shifts are annotated in a systematic way will make it possible to study those phenomena that need to be addressed if machine translation output is to resemble human translation. The resource described in this paper contains English source texts (parliamentary proceedings) and their German translations. The shift annotation is based on predicate-argument structures and proceeds in two steps: first, predicates and their arguments are annotated monolingually in a straightforward manner. Then, the corresponding English and German predicates and arguments are aligned with each other. Whenever a shift – mainly grammatical or semantic – has occurred, the alignment is tagged accordingly.