Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora
Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
Total Pages: 321
Release: 2014-12-18
Genre: Language Arts & Disciplines
ISBN: 1441119914

Download Corpus Linguistics and Linguistically Annotated Corpora Book in PDF, Epub and Kindle

Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora
Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
Total Pages: 321
Release: 2014-12-18
Genre: Language Arts & Disciplines
ISBN: 1441119809

Download Corpus Linguistics and Linguistically Annotated Corpora Book in PDF, Epub and Kindle

Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

Corpus Linguistics and Linguistically Annotated Corpora

Corpus Linguistics and Linguistically Annotated Corpora
Author: Sandra Kuebler
Publisher: Bloomsbury Publishing
Total Pages: 321
Release: 2015-02-12
Genre: Language Arts & Disciplines
ISBN: 1441116753

Download Corpus Linguistics and Linguistically Annotated Corpora Book in PDF, Epub and Kindle

Introduces corpus linguistics with a focus on linguistically annotated corpora, enabling analysis of a wide range of linguistic phenomena.

Language Corpora Annotation and Processing

Language Corpora Annotation and Processing
Author: Niladri Sekhar Dash
Publisher: Springer Nature
Total Pages:
Release: 2021
Genre: Computational linguistics
ISBN: 9811629609

Download Language Corpora Annotation and Processing Book in PDF, Epub and Kindle

This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.

Corpus Annotation

Corpus Annotation
Author: Roger Garside
Publisher: Routledge
Total Pages: 304
Release: 1997
Genre: Computers
ISBN:

Download Corpus Annotation Book in PDF, Epub and Kindle

This is a text which surveys the growing field of research known as corpus annotation - an electronic collection of texts. Corpus annotation is a central resource in linguisticsi̧nformation technology and the processing of human language. The book seeks to show the nature of language and the most effective means of analysing it. A bibliography lists relevant e-mail addresses and Web sites.

An Introduction to Corpus Linguistics

An Introduction to Corpus Linguistics
Author: Graeme Kennedy
Publisher: Routledge
Total Pages: 328
Release: 2014-09-19
Genre: Language Arts & Disciplines
ISBN: 1317892585

Download An Introduction to Corpus Linguistics Book in PDF, Epub and Kindle

The use of large, computerized bodies of text for linguistic analysis and description has emerged in recent years as one of the most significant and rapidly-developing fields of activity in the study of language. This book provides a comprehensive introduction and guide to Corpus Linguistics. All aspects of the field are explored, from the various types of electronic corpora that are available to instructions on how to design and compile a corpus. Graeme Kennedy surveys the development of corpora for use in linguistic research, looking back to the pre-electronic age as well as to the massive growth of computer corpora in the electronic age.

English Corpus Linguistics

English Corpus Linguistics
Author: Charles F. Meyer
Publisher: Cambridge University Press
Total Pages: 211
Release: 2023-06-30
Genre: Language Arts & Disciplines
ISBN: 1009365428

Download English Corpus Linguistics Book in PDF, Epub and Kindle

Corpus linguistics is a research method which draws on authentic language examples, collected and organized into 'corpora', or searchable 'bodies' of data. The method was established in the 1960s, and has rapidly developed since then. Now in its second edition, this book provides a step-by-step guide on how to create and analyze linguistic corpora. It has been extensively updated to reflect the most recent developments in this ever-evolving field, and now covers the empirical foundation of corpus-based research, new methodological considerations that guide the creation of a corpus, new kinds of research that can be conducted on corpora, and the most up-to-date information on how qualitative and quantitative analyses of corpora are conducted. Theoretical approaches are introduced in an accessible, easy-to-read way, and the book is illustrated with a wide range of different linguistic corpora, making it essential reading for researchers and students in a number of subfields of linguistics.

Spoken Corpora and Linguistic Studies

Spoken Corpora and Linguistic Studies
Author: Tommaso Raso
Publisher: John Benjamins Publishing Company
Total Pages: 508
Release: 2014-11-14
Genre: Language Arts & Disciplines
ISBN: 9027270031

Download Spoken Corpora and Linguistic Studies Book in PDF, Epub and Kindle

The authors of this book share a common interest in the following topics: the importance of corpora compilation for the empirical study of human language; the importance of pragmatic categories such as emotion, attitude, illocution and information structure in linguistic theory; and a passionate belief in the central role of prosody for the analysis of speech. Four distinct sections (spoken corpora compilation; spoken corpora annotation; prosody; and syntax and information structure) give the book the structure in which the authors present innovative methodologies that focus on the compilation of third generation spoken corpora; multilevel spoken corpora annotation and its functions; and additionally a debate is initiated about the reference unit in the study of spoken language via information structure. The book is accompanied by a web site with a rich array of audio/video files. The web site can be found at the following address: DOI: 10.1075/scl.61.media

Corpus Analysis

Corpus Analysis
Author:
Publisher: BRILL
Total Pages: 294
Release: 2016-08-09
Genre: Language Arts & Disciplines
ISBN: 9004334416

Download Corpus Analysis Book in PDF, Epub and Kindle

The papers published in this volume were originally presented at the Third North American Symposium on Corpus Linguistics and Language Teaching held on 23-25 March 2001 at the Park Plaza Hotel in Boston, Massachusetts. Each paper analyses some aspect of language use or structure in one or more of the many linguistic corpora now available. The number of different corpora investigated in the book is a real testament to the progress that has been made in recent years in developing new corpora, particularly spoken corpora, as over half of the papers deal either wholly or partially with the analysis of spoken data. This book will be of particular interest to undergraduate and graduate students and scholars interested in corpus, socio and applied linguistics, discourse analysis, pragmatics, and language teaching.

Developing Linguistic Corpora

Developing Linguistic Corpora
Author: Martin Wynne
Publisher: Oxbow Books Limited
Total Pages: 100
Release: 2005
Genre: Language Arts & Disciplines
ISBN:

Download Developing Linguistic Corpora Book in PDF, Epub and Kindle

A linguistic corpus is a collection of texts which have been selected and brought together so that language can be studied on the computer. Today, corpus linguistics offers some of the most powerful new procedures for the analysis of language, and the impact of this dynamic and expanding sub-discipline is making itself felt in many areas of language study. In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. This guide is aimed at those who are at some stage of building a linguistic corpus. Little or no knowledge of corpus linguistics or computational procedures is assumed, although it is hoped that more advanced users will find the guidelines here useful. It is also aimed at those who are not building a corpus, but who need to know something about the issues involved in the design of corpora in order to choose between available resources and to help draw conclusions from their studies.