![]() |
OLAC Record oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1051 |
Metadata | ||
Title: | KIParla - ParlaTO transcripts | |
Bibliographic Citation: | http://hdl.handle.net/20.500.11752/OPEN-1051 | |
Creator: | Ballarè, Silvia | |
Cerruti, Massimo | ||
Date (W3CDTF): | 2025-10-07T05:24:46Z | |
Date Available: | 2025-10-07T05:24:46Z | |
Description: | The ParlaTO corpus is part of the larger KIParla collection (www.kiparla.it), which can be freely queried through the NoSketch Engine interface. The ParlaTO corpus was was funded by the CRT Foundation ("ParlaTO - Corpus del Parlato di Torino" project). It consists of about 50 hours of interactions collected in Turin and its province through semi-structured interviews. The interviews, conducted between 2018 and 2020, involved 88 speakers with different origins, ages, education levels, and types of occupation, and addressed personal life experiences in the city (study, work, leisure activities, retirement, memories of the past, etc.). The transcriptions have been anonymized. Overall, the module is made up of 68 conversations and includes 100 speakers. This repository contains: • metadata for both speakers (occupation, gender, age, origin, L1, educational achievement) and conversations (collection point, year, languages used), in the metadata subfolder • descriptions of the set of transcription conventions used for this module • for each conversation you will find: .eaf file in eaf/ folder (time-aligned Jefferson-style transcriptions); .txt file in linear-jefferson/ folder (linearized Jefferson-style transcription); .txt file in linear-orthographic/ folder (linearized transcription retaining only orthographic words); .tsv file in tsv/ folder (tokenised version of the transcription). More information can be found in the README.md file. Due to GDPR restrictions, pseudo-anonymized audio files (MP3) are available under a restricted-access license. To request access, please contact the corpus coordinators through the KIParla website and follow the provided procedure. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. | |
Identifier (URI): | http://hdl.handle.net/20.500.11752/OPEN-1051 | |
Language: | Italian | |
Piemontese | ||
Language (ISO639): | ita | |
pms | ||
Publisher: | Università degli studi di Torino | |
Rights: | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | |
http://creativecommons.org/licenses/by-nc-sa/4.0/ | ||
Subject: | face-to-face interviews | |
human-human spoken dialogues | ||
spontaneous speech | ||
spoken Italian | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", National Research Council, in Pisa | |
Description: | http://www.language-archives.org/archive/dspace-clarin-it.ilc.cnr.it | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1051 | |
DateStamp: | 2025-10-07 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Ballarè, Silvia; Cerruti, Massimo. 2025. Università degli studi di Torino. | |
Terms: | area_Europe country_IT dcmi_Text iso639_ita iso639_pms olac_primary_text |