A Model for Fine-Grained Alignment of Multilingual Texts

While alignment of texts on the sentential level is often seen as being too coarse, and word alignment as being too fine-grained, bi- or multilingual texts which are aligned on a level in-between are a useful resource for many purposes. Starting from a number of examples of non-literal translations,...

Authors: Cyrus, Lea
Feddes, Hendrik
Division/Institute:FB 09: Philologie
Document types:Article
Media types:Text
Publication date:2004
Date of publication on miami:21.06.2006
Modification date:06.04.2022
Edition statement:[Electronic ed.]
Source:Proc. COLING 2004 Workshop on Multilingual Linguistic Resources (MLR2004). Geneva, August 28 (2004), S. 15-22
Subjects:Korpuslinguistik; Computerlinguistik; syntaktische Annotation; semantische Annotation
DDC Subject:400: Sprache
License:InC 1.0
Language:Englisch
Format:PDF document
URN:urn:nbn:de:hbz:6-92619507330
Permalink:https://nbn-resolving.de/urn:nbn:de:hbz:6-92619507330
Digital documents:0408_coling.pdf
LEADER 02117cam a2200325uu 4500
001 c5da43cd-a4f6-4a47-a26a-57e350dca329
003 miami
005 20220406
007 c||||||||||||a|
008 060621e20060621||||||||||#s||||||||eng||||||
024 7 |a urn:nbn:de:hbz:6-92619507330  |2 urn 
041 |a eng 
082 0 |a 400 Sprache  |2 23 
100 1 |a Cyrus, Lea  |4 aut 
110 2 |a Universitäts- und Landesbibliothek Münster  |0 http://d-nb.info/gnd/5091030-9  |4 own 
245 1 0 |a A Model for Fine-Grained Alignment of Multilingual Texts 
250 |a [Electronic ed.] 
264 1 |c 2004 
264 2 |b Universitäts- und Landesbibliothek Münster  |c 2006-06-21 
300 |a 15-22 
506 0 |a free access 
510 0 |a Proc. COLING 2004 Workshop on Multilingual Linguistic Resources (MLR2004). Geneva, August 28 (2004), S. 15-22 
520 3 |a While alignment of texts on the sentential level is often seen as being too coarse, and word alignment as being too fine-grained, bi- or multilingual texts which are aligned on a level in-between are a useful resource for many purposes. Starting from a number of examples of non-literal translations, which tend to make alignment difficult, we describe an alignment model which copes with these cases by explicitly coding them. The model is based on predicate-argument structures and thus covers the middle ground between sentence and word alignment. The model is currently used in a recently initiated project of a parallel English-German treebank (FuSe), which can in principle be extended with additional languages. 
540 |a InC 1.0  |u https://rightsstatements.org/vocab/InC/1.0/ 
653 0 |a Korpuslinguistik  |a Computerlinguistik  |a syntaktische Annotation  |a semantische Annotation 
655 7 |2 DRIVER Types  |a Artikel 
655 7 |2 DCMI Types  |a Text 
700 1 |a Feddes, Hendrik  |u FB 09: Philologie  |4 aut 
856 4 0 |3 Zum Volltext  |q text/html  |u https://nbn-resolving.de/urn:nbn:de:hbz:6-92619507330  |u urn:nbn:de:hbz:6-92619507330 
856 4 0 |3 Zum Volltext  |q application/pdf  |u https://repositorium.uni-muenster.de/document/miami/c5da43cd-a4f6-4a47-a26a-57e350dca329/0408_coling.pdf