VOICE: Vienna-Oxford International Corpus of English
| Title | VOICE: Vienna-Oxford International Corpus of English |
| Author | Barbara Seidlhofer ; Angelika Breiteneder; Theresa Klimpfinger; Stefan Majewski; Marie-Luise Pitzl |
| Availability | This resource is freely available, you should be able to download it now. |
| Languages | English |
| Editorial Practice | Encoding format: TEI P5 XML |
| OTA keywords |
Linguistic corpora Corpus |
| LC keywords | |
| Extent |
|
| Creation Date | The corpus data were collected between 2001 and 2007, and compiled 2005 to 2009. VOICE Online was published in 2009, and VOICE XML in 2011. |
| Source Description |
: VOICE is based on audio-recordings of 151 naturally-occurring, non-scripted, face-to-face interactions involving 753 identified individuals from 49 different first language backgrounds using English as a lingua franca (ELF), i.e. English used as a common means of communication among speakers from different first-language backgrounds. The recordings were carried out between July 2001 and November 2007, usually using portable mini-disc recorders with external microphones. Most of the audio recordings are supplemented by detailed field notes including information about the nature of the speech event and the interaction taking place as well as about the participants engaging in these ELF interactions. The interactions recorded are complete speech events from different domains (educational, leisure, professional) and of different speech event types (conversation, interview, meeting, panel, press conference, question-answer session, seminar discussion, service encounter, working group discussion, workshop discussion). The audio-recordings were transcribed, checked and proof-read by trained transcribers and researchers in accordance with the VOICE mark-up and spelling conventions [2.1] (see http://www.univie.ac.at/voice/page/transcription_general_information). Details for each electronic text are given in the individual text headers. The principles and practices underlying the selection an |
| Notes | The primary language of the corpus is English as a lingua franca, with some switches to other languages. The corpus consists of manual transcriptions of audio recordings of speech. |
