OLAC Record

Title:JV_TDM Corpus
Access Rights: Rights available for: attribution
Date Available (W3CDTF):2016-01-05
Date Issued (W3CDTF):2016-01-05
Date Modified (W3CDTF):2016-01-19
Description:The JV_TDM corpus provides a phonetic annotation of 37 chapters of the original French version of “Around the World in 80 Days” by Jules Verne read by a single speaker. Each chapter has been annotated in a separate .TextGrid file. The audio files are not included in this release. They are available under a CC BY-NC-SA licence on the site www.litteratureaudio.com (www.litteratureaudio.com/livre-audio-gratuit-mp3/jules-verne-le-tour-du-monde-en-80-jours.html).The total audio size is 6h 41mn 36s with 5h 2mn 41s of speech. In the JV_TDM corpus, the speaker uttered 78,876 words at an average speed of 5.82 syllables and 13.49 phones per second. The speaker produced 244,908 phones and 11,352 pauses (short and long). All phonemes except glottal stops and palatal/velar nasals are encountered more than 1000 times.The .TextGrid files contain several annotation tiers: phoneme, number of alphanumeric characters corresponding to a phone, syllable, transcription, PoS, paragraph break, sentence break, prosodic annotations, breathing pauses.With the text-to-speech system COMPOST, the original text material was first PoS annotated, phonetically transcribed, syllabified and plausible pauses were inserted. Text-to-speech alignment was then performed on paragraphs which were manually delimited with Praat. The segmentation and all the annotations were manually validated.Reference:Bailly, G. & C. Gouvernayre (2012). Pauses and respiratory markers of the structure of book reading. Interspeech. Portland, OR, pp. 2218-2221.
ISLRN: 371-240-320-910-4
Identifier (URI):http://catalog.elra.info/en-us/repository/browse/ELRA-S0379/
Language (ISO639):fra
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0379
DateStamp:  2016-01-05
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2016. ELRA (European Language Resources Association).
Terms: area_Europe country_FR dcmi_Sound iso639_fra olac_primary_text

Up-to-date as of: Wed Nov 17 9:08:54 EST 2021