OLAC Record
oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1051

Metadata
Title:KIParla - ParlaTO transcripts
Bibliographic Citation:http://hdl.handle.net/20.500.11752/OPEN-1051
Creator:Ballarè, Silvia
Cerruti, Massimo
Date (W3CDTF):2025-10-07T05:24:46Z
Date Available:2025-10-07T05:24:46Z
Description:The ParlaTO corpus is part of the larger KIParla collection (www.kiparla.it), which can be freely queried through the NoSketch Engine interface. The ParlaTO corpus was was funded by the CRT Foundation ("ParlaTO - Corpus del Parlato di Torino" project). It consists of about 50 hours of interactions collected in Turin and its province through semi-structured interviews. The interviews, conducted between 2018 and 2020, involved 88 speakers with different origins, ages, education levels, and types of occupation, and addressed personal life experiences in the city (study, work, leisure activities, retirement, memories of the past, etc.). The transcriptions have been anonymized. Overall, the module is made up of 68 conversations and includes 100 speakers. This repository contains: • metadata for both speakers (occupation, gender, age, origin, L1, educational achievement) and conversations (collection point, year, languages used), in the metadata subfolder • descriptions of the set of transcription conventions used for this module • for each conversation you will find: .eaf file in eaf/ folder (time-aligned Jefferson-style transcriptions); .txt file in linear-jefferson/ folder (linearized Jefferson-style transcription); .txt file in linear-orthographic/ folder (linearized transcription retaining only orthographic words); .tsv file in tsv/ folder (tokenised version of the transcription). More information can be found in the README.md file. Due to GDPR restrictions, pseudo-anonymized audio files (MP3) are available under a restricted-access license. To request access, please contact the corpus coordinators through the KIParla website and follow the provided procedure. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Identifier (URI):http://hdl.handle.net/20.500.11752/OPEN-1051
Language:Italian
Piemontese
Language (ISO639):ita
pms
Publisher:Università degli studi di Torino
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:face-to-face interviews
human-human spoken dialogues
spontaneous speech
spoken Italian
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", National Research Council, in Pisa
Description:  http://www.language-archives.org/archive/dspace-clarin-it.ilc.cnr.it
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1051
DateStamp:  2025-10-07
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Ballarè, Silvia; Cerruti, Massimo. 2025. Università degli studi di Torino.
Terms: area_Europe country_IT dcmi_Text iso639_ita iso639_pms olac_primary_text


http://www.language-archives.org/item.php/oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1051
Up-to-date as of: Wed Oct 8 0:33:10 EDT 2025