Logo Logo
Hilfe
Hilfe
Switch Language to English

Draxler, Christoph und Schiel, Florian (2002): Three New Corpora at the Bavarian Archive for Speech Signals - and a First Step Towards Distributed Web-Based Recording. Third International Conference on Language Resources and Evaluation (LREC) 2002, Las Palmas, Spanien, 29. - 31. Mai 2002. Gonzáles Rodriguez, Manual (Hrsg.): In: LREC 2002 Third International Conference on Language Resources and Evaluation, proceedings. Las Palmas de Gran Canaria, 29th, 30th & 31st May 2002, Bd. 1 Paris: S. 21-24 [PDF, 197kB]

[thumbnail of schiel_13679.pdf]
Vorschau
Download (197kB)

Abstract

The Bavarian Archive for Speech Signals has released three new speech corpora for both industrial and academic use: a) Hempels Sofa contains recordings of up to 60 seconds of non-scripted telephone speech, b) ZipTel is a corpus with telephone speech covering postal addresses and telephone numbers from a real world application, and c) RVG-J, an extension of the original Regional Variants of German corpus with juvenile speakers. All three corpora were transcribed orthographically according to the SpeechDat annotation guidelines using the WWWTranscribe annotation software. Recently, BAS has begun to investigate performing large-scale audio recordings via the web, and RVG-J has become the testbed for this type of recording.

Dokument bearbeiten Dokument bearbeiten