Parallel text
From Free net encyclopedia
Revision as of 13:17, 12 October 2005; view current revision
←Older revision | Newer revision→
←Older revision | Newer revision→
A parallel text is a text in one language together with its translation in another language. Parallel text alignment is the identification of the corresponding sentences in both halves of the parallel text.
Large collections of parallel texts are called parallel corpora (see text corpus). Alignments of parallel corpora at sentence level are prerequisite for many areas of linguistic research. During translation, sentences can be split, merged, deleted, inserted or changed in order. This makes alignment a non-trivial task.
[edit]
See also
[edit]
External links
- Parallel text processing bibliography by J. Veronis and M.-D. Mahimon
- The Opus project aims at collecting freely available parallel corpora
[edit]