Quality Estimation for Machine Translation

Quality Estimation for Machine Translation
Author: Lucia Specia
Publisher: Springer Nature
Total Pages: 148
Release: 2022-05-31
Genre: Computers
ISBN: 3031021681

Download Quality Estimation for Machine Translation Book in PDF, Epub and Kindle

Many applications within natural language processing involve performing text-to-text transformations, i.e., given a text in natural language as input, systems are required to produce a version of this text (e.g., a translation), also in natural language, as output. Automatically evaluating the output of such systems is an important component in developing text-to-text applications. Two approaches have been proposed for this problem: (i) to compare the system outputs against one or more reference outputs using string matching-based evaluation metrics and (ii) to build models based on human feedback to predict the quality of system outputs without reference texts. Despite their popularity, reference-based evaluation metrics are faced with the challenge that multiple good (and bad) quality outputs can be produced by text-to-text approaches for the same input. This variation is very hard to capture, even with multiple reference texts. In addition, reference-based metrics cannot be used in production (e.g., online machine translation systems), when systems are expected to produce outputs for any unseen input. In this book, we focus on the second set of metrics, so-called Quality Estimation (QE) metrics, where the goal is to provide an estimate on how good or reliable the texts produced by an application are without access to gold-standard outputs. QE enables different types of evaluation that can target different types of users and applications. Machine learning techniques are used to build QE models with various types of quality labels and explicit features or learnt representations, which can then predict the quality of unseen system outputs. This book describes the topic of QE for text-to-text applications, covering quality labels, features, algorithms, evaluation, uses, and state-of-the-art approaches. It focuses on machine translation as application, since this represents most of the QE work done to date. It also briefly describes QE for several other applications, including text simplification, text summarization, grammatical error correction, and natural language generation.

From Syntax to Semantics

From Syntax to Semantics
Author: Erich Steiner
Publisher: Intellect Books
Total Pages: 280
Release: 1988
Genre: Computers
ISBN:

Download From Syntax to Semantics Book in PDF, Epub and Kindle

Machine translation is a central aspect of research in artifical intelligence. This book is written in the context of the Machine Translation (MT) project EUROTRA, a multi-lingual MT-project putting special emphasis on the definition of semantic representation.

Machine Translation of User-generated Content

Machine Translation of User-generated Content
Author: Pintu Lohar
Publisher:
Total Pages: 0
Release: 2020
Genre:
ISBN:

Download Machine Translation of User-generated Content Book in PDF, Epub and Kindle

The world of social media has undergone huge evolution during the last few years. With the spread of social media and online forums, individual users actively participate in the generation of online content in different languages from all over the world. Sharing of online content has become much easier than before with the advent of popular websites such as Twitter, Facebook etc. Such content is referred to as 'User-Generated Content' (UGC). Some examples of UGC are user reviews, customer feedback, tweets etc. In general, UGC is informal and noisy in terms of linguistic norms. Such noise does not create significant problems for human to understand the content, but it can pose challenges for several natural language processing applications such as parsing, sentiment analysis, machine translation (MT), etc. An additional challenge for MT is sparseness of bilingual (translated) parallel UGC corpora. In this research, we explore the general issues in MT of UGC and set some research goals from our findings. One of our main goals is to exploit comparable corpora in order to extract parallel or semantically similar sentences. To accomplish this task, we design a document alignment system to extract semantically similar bilingual document pairs using the bilingual comparable corpora. We then apply strategies to extract parallel or semantically similar sentences from comparable corpora by transforming the document alignment system into a sentence alignment system. We seek to improve the quality of parallel data extraction for UGC translation and assemble the extracted data with the existing human translated resources. Another objective of this research is to demonstrate the usefulness of MT-based sentiment analysis. However, when using openly available systems such as Google Translate, the translation process may alter the sentiment in the target language. To cope with this phenomenon, we instead build fine-grained sentiment translation models that focus on sentiment preservation in the target language during translation.

Metataxis

Metataxis
Author: Klaus Schubert
Publisher: Walter de Gruyter GmbH & Co KG
Total Pages: 252
Release: 2019-10-08
Genre: Language Arts & Disciplines
ISBN: 311087606X

Download Metataxis Book in PDF, Epub and Kindle

No detailed description available for "Metataxis".

Linguistic Issues in Machine Translation

Linguistic Issues in Machine Translation
Author: Frank Van Eydne
Publisher: Bloomsbury Publishing
Total Pages: 258
Release: 2015-12-17
Genre: Language Arts & Disciplines
ISBN: 1474246559

Download Linguistic Issues in Machine Translation Book in PDF, Epub and Kindle

The key assumption in this text is that machine translation is not merely a mechanical process but in fact requires a high level of linguistic sophistication, as the nuances of syntax, semantics and intonation cannot always be conveyed by modern technology. The increasing dependence on artificial communication by private and corporate users makes this research area an invaluable element when teaching linguistic theory.

Advances in Empirical Translation Studies

Advances in Empirical Translation Studies
Author: Meng Ji
Publisher: Cambridge University Press
Total Pages: 285
Release: 2019-06-13
Genre: Computers
ISBN: 1108423272

Download Advances in Empirical Translation Studies Book in PDF, Epub and Kindle

Introduces the integration of theoretical and applied translation studies for socially-oriented and data-driven empirical translation research.

Syntax-based Statistical Machine Translation

Syntax-based Statistical Machine Translation
Author: Philip Williams
Publisher: Springer Nature
Total Pages: 190
Release: 2022-05-31
Genre: Computers
ISBN: 3031021649

Download Syntax-based Statistical Machine Translation Book in PDF, Epub and Kindle

This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.

Machine Translation

Machine Translation
Author: John Lehrberger
Publisher: John Benjamins Publishing
Total Pages: 257
Release: 1988-01-01
Genre: Language Arts & Disciplines
ISBN: 9027286205

Download Machine Translation Book in PDF, Epub and Kindle

The use of the computer in translating natural languages ranges from that of a translator's aid for word processing and dictionary lookup to that of a full-fledged translator on its own. However the obstacles to translating by means of the computer are primarily linguistic. To overcome them it is necessary to resolve the ambiguities that pervade a natural language when words and sentences are viewed in isolation. The problem then is to formalize, in the computer, these aspects of natural language understanding. The authors show how, from a linguistic point of view, one may form some idea of what goes on inside a system's black box, given only the input (original text) and the raw output (translated text before post-editing). Many examples of English/French translation are used to illustrate the principles involved.

Machine Translation Summit

Machine Translation Summit
Author: Makoto Nagao
Publisher: IOS Press
Total Pages: 248
Release: 1989
Genre: Computers
ISBN: 9784274074455

Download Machine Translation Summit Book in PDF, Epub and Kindle