D-LUCEA: Curation of the UCU Accent Project Data
Rosemary Orr, Hugo Quené
Chapter from the book: Odijk J. & van Hessen A. 2017. CLARIN in the Low Countries.
Chapter from the book: Odijk J. & van Hessen A. 2017. CLARIN in the Low Countries.
The UCU Accent Project was set up in 2010 to collect a wide variety of non-native and native accents of English in an environment where English is the lingua franca, namely an international liberal arts and sciences college in Utrecht in the Netherlands. The recordings were made longitudinally over the three years of undergraduate study, and four cohorts of students were recorded in total. This yielded over 1,000 speech recordings over a six-year period in which the development of both native and non-native English accents in a non-native environment can be examined. In order to facilitate sharing the data with the wider research community, the D-LUCEA project undertook to curate the data. For each recording, the relevant concomitant metadata was produced, giving information to users of the database about the speaker, the technical specifications, the kinds of speech material recorded, and so forth. The project was funded by CLARIN, and specific CLARIN tools for curation were made available to us, including the Component Metadata Infrastructure (CMDI). To date, all of the speech data has been processed such that the metadata is available, and research is already running on this corpus, on topics as varied as prosodic convergence, L1 phonetic drift and phone convergence. Further plans include work with speaker recognition, accent recognition and models of language learning such as Flege’s Speech Learning Model, the Critical Theory Hypothesis, and the Perceptual Assimilation Model.
Orr R. & Quené H. 2017. D-LUCEA: Curation of the UCU Accent Project Data. In: Odijk J. & van Hessen A (eds.), CLARIN in the Low Countries. London: Ubiquity Press. DOI: https://doi.org/10.5334/bbi.15
This is an Open Access chapter distributed under the terms of the Creative Commons Attribution 4.0 license (unless stated otherwise), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. Copyright is retained by the author(s).
This book has been peer reviewed. See our Peer Review Policies for more information.
Published on Dec. 28, 2017