A Model for Fine-Grained Alignment of Multilingual Texts
While alignment of texts on the sentential level is often seen as being too coarse, and word alignment as being too fine-grained, bi- or multilingual texts which are aligned on a level in-between are a useful resource for many purposes. Starting from a number of examples of non-literal translations,...
Authors: | |
---|---|
Division/Institute: | FB 09: Philologie |
Document types: | Article |
Media types: | Text |
Publication date: | 2004 |
Date of publication on miami: | 21.06.2006 |
Modification date: | 06.04.2022 |
Edition statement: | [Electronic ed.] |
Source: | Proc. COLING 2004 Workshop on Multilingual Linguistic Resources (MLR2004). Geneva, August 28 (2004), S. 15-22 |
Subjects: | Korpuslinguistik; Computerlinguistik; syntaktische Annotation; semantische Annotation |
DDC Subject: | 400: Sprache |
License: | InC 1.0 |
Language: | English |
Format: | PDF document |
URN: | urn:nbn:de:hbz:6-92619507330 |
Permalink: | https://nbn-resolving.de/urn:nbn:de:hbz:6-92619507330 |
Digital documents: | 0408_coling.pdf |
While alignment of texts on the sentential level is often seen as being too coarse, and word alignment as being too fine-grained, bi- or multilingual texts which are aligned on a level in-between are a useful resource for many purposes. Starting from a number of examples of non-literal translations, which tend to make alignment difficult, we describe an alignment model which copes with these cases by explicitly coding them. The model is based on predicate-argument structures and thus covers the middle ground between sentence and word alignment. The model is currently used in a recently initiated project of a parallel English-German treebank (FuSe), which can in principle be extended with additional languages.