No Cover Image

Book 835 views

Building a National Corpus: A Welsh Language Case Study

Dawn Knight, Steve Morris, Laura Arman, Jennifer Needs, Mair Rees

Building a National Corpus: A Welsh Language Case Study

Swansea University Authors: Steve Morris, Jennifer Needs, Mair Rees

Full text not available from this repository: check for access using links below.

DOI (Published version): 10.1007/978-3-030-81858-6

Abstract

This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the developmen...

Full description

Published in: Building a National Corpus: A Welsh Language Case Study
ISBN: 9783030818579 9783030818586
Published: Cham Springer International Publishing 2021
URI: https://cronfa.swan.ac.uk/Record/cronfa58387
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract: This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the development of detailed design frames for corpora across communicative modes (spoken, written and e-language), and the practical processes involved in the planning, collection, transcription, collation and (re)presentation of language data. The book is designed to be of significant value and relevance to those interested in critically engaging with corpus methodology. Although Welsh is the language under discussion, the processes and approaches discussed in the building of CorCenCC can be applied to a lesser or greater extent to other language contexts. This book provides a working model, and an account of how to build a corpus dataset from which step by step guidelines for creating other linguistic corpora in any language can be easily extrapolated. It will be of value to students and scholars of minority languages and corpus linguistics.
Keywords: corpus linguistics; minority languages; Welsh language; Welsh linguistics; CorCenCC; National corpora; e-language; spoken data; written data
College: Faculty of Humanities and Social Sciences
Funders: ESRC/AHRC