OLAC Record
oai:catalogue.elra.info:ELRA-S0396

Metadata
Title:Mbochi speech corpus
Abstract:This corpus consists of 5131 sentences recorded in mbochi, together with their transcription and French translation, as well as the results from the work made during JSALT workshop: alignments at the phonetic level and various results of unsupervised word segmentation from audio. The audio corpus is made up of 4,5 hours, downsampled at 16kHz, 16bits, with Linear PCM encoding. Data is distributed into 2 parts, one for training consisting of 4617 sentences, and one for development consisting of 514 sentences.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2018-05-31
Date Issued (W3CDTF):2018-05-31
Date Modified (W3CDTF):2018-05-31
Description:Desktop/Microphone
The Mbochi speech corpus was developed in the framework of ANR-DFG BULB project. This project aims to provide field linguists (eg working on morphology) with tools for less or not written languages. The provided corpus is a subset from the corpus developed in this framework. The provided corpus consists of 5131 sentences recorded in mbochi, together with their transcription and French translation, as well as the results from the work made during JSALT workshop (within one of the topics which was "the speaking Rosetta stone - Discovering Grounded Linguistic Units for Languages without Orthography"): alignments at the phonetic level and various results of unsupervised word segmentation from audio. The audio corpus is made up of 4,5 hours, downsampled at 16kHz, 16bits, with Linear PCM encoding. Data is distributed into 2 parts, one for training consisting of 4617 sentences, and one for development consisting of 514 sentences.
Identifier:ELRA-S0396
http://catalog.elra.info/product_info.php?products_id=1320
Language:Achinese
French
Language (ISO639):ace
fra
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0396
DateStamp:  2018-05-31
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2018. ELRA (European Language Resources Association).
Terms: area_Asia area_Europe country_FR country_ID dcmi_Sound iso639_ace iso639_fra olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0396
Up-to-date as of: Wed Oct 2 8:22:53 EDT 2019