Limit search to available items
Record 16 of 60
Previous Record Next Record
Book Cover
E-book
Author Tiedemann, Jörg.

Title Bitext alignment / Jörg Tiedemann
Published Cham, Switzerland : Springer, ©2011
Online access available from:
Synthesis Digital Library    View Resource Record  

Copies

Description 1 online resource (viii, 153 pages) : illustrations
Series Synthesis lectures on human language technologies, 1947-4059 ; #14
Synthesis lectures on human language technologies ; lecture #14. 1947-4059
Contents Preface -- Acknowledgments -- 1. Introduction -- Applications -- Further readings -- 2. Basic concepts and terminology -- Bitext and alignment -- Alignment and segmentation -- Alignment spaces and constraints -- Correlations and cues -- Alignment models and search algorithms -- Evaluation of bitext alignment -- Summary and further reading -- 3. Building parallel corpora -- Document alignment -- Mining the web -- Extracting parallel data from comparable corpora -- Summary and further reading -- 4. Sentence alignment -- Length-based approaches -- Lexical matching approaches -- Combined and resource-specific techniques -- Summary and further reading -- 5. Word alignment -- Generative alignment models -- Constraints and heuristics -- Discriminative alignment models -- Translation spotting and bilingual lexicon induction -- Summary and further reading -- 6. Phrase and tree alignment -- Parallel treebanks and tree alignment -- Hierarchical alignment and transduction grammars -- Summary and further reading -- 7. Concluding remarks -- Final recommendations -- A. Resources & tools -- Bibliography -- Author's biography
Summary This book provides an overview of various techniques for the alignment of bitexts. It describes general concepts and strategies that can be applied to map corresponding parts in parallel documents on various levels of granularity. Bitexts are valuable linguistic resources for many different research fields and practical applications. The most predominant application is machine translation, in particular, statistical machine translation. However, there are various other threads that can be followed which may be supported by the rich linguistic knowledge implicitly stored in parallel resources. Bitexts have been explored in lexicography, word sense disambiguation, terminology extraction, computer-aided language learning and translation studies to name just a few. The book covers the essential tasks that have to be carried out when building parallel corpora starting from the collection of translated documents up to sub-sentential alignments. In particular, it describes various approaches to document alignment, sentence alignment, word alignment and tree structure alignment. It also includes a list of resources and a comprehensive review of the literature on alignment techniques
Analysis Alignment
Bitexts
Parallel corpora
Sentence alignment
Word alignment
Tree alignment
Statistical machine translation
Transduction grammars
Text mining
Lexicon induction
Bibliography Includes bibliographical references (pages 129-152)
Subject Machine translating
Computational linguistics.
computational linguistics.
LANGUAGE ARTS & DISCIPLINES -- Translating & Interpreting.
Computational linguistics
Machine translating
Form Electronic book
ISBN 9781608455119
1608455114
9781608455102
1608455106
9783031021428
3031021428