Normal view MARC view ISBD view

Syntax-based Statistical Machine Translation [electronic resource] / by Philip Williams, Rico Sennrich, Matt Post, Philipp Koehn.

By: Williams, Philip [author.].
Contributor(s): Sennrich, Rico [author.] | Post, Matt [author.] | Koehn, Philipp [author.] | SpringerLink (Online service).
Material type: materialTypeLabelBookSeries: Synthesis Lectures on Human Language Technologies: Publisher: Cham : Springer International Publishing : Imprint: Springer, 2016Edition: 1st ed. 2016.Description: XVIII, 190 p. online resource.Content type: text Media type: computer Carrier type: online resourceISBN: 9783031021640.Subject(s): Artificial intelligence | Natural language processing (Computer science) | Computational linguistics | Artificial Intelligence | Natural Language Processing (NLP) | Computational LinguisticsAdditional physical formats: Printed edition:: No title; Printed edition:: No titleDDC classification: 006.3 Online resources: Click here to access online
Contents:
Preface -- Acknowledgments -- Models -- Learning from Parallel Text -- Decoding I: Preliminaries -- Decoding II: Tree Decoding -- Decoding III: String Decoding -- Selected Topics -- Closing Remarks -- Bibliography -- Authors' Biographies -- Author Index -- Index.
In: Springer Nature eBookSummary: This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, includingsearch approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.
    average rating: 0.0 (0 votes)
No physical items for this record

Preface -- Acknowledgments -- Models -- Learning from Parallel Text -- Decoding I: Preliminaries -- Decoding II: Tree Decoding -- Decoding III: String Decoding -- Selected Topics -- Closing Remarks -- Bibliography -- Authors' Biographies -- Author Index -- Index.

This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, includingsearch approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.

There are no comments for this item.

Log in to your account to post a comment.