![]() |
OLAC Record oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1048 |
Metadata | ||
Title: | KIParla - KIP transcripts | |
Bibliographic Citation: | http://hdl.handle.net/20.500.11752/OPEN-1048 | |
Creator: | Ballarè, Silvia | |
Goria, Eugenio | ||
Mauri, Caterina | ||
Date (W3CDTF): | 2025-10-07T05:23:40Z | |
Date Available: | 2025-10-07T05:23:40Z | |
Description: | The KIP corpus is part of the larger KIParla collection (www.kiparla.it), which can be freely queried through the NoSketch Engine interface. The KIP corpus was compiled within the framework of the LEAdhoC project – Linguistic Expression of Ad Hoc Categories, funded by the Italian Ministry of Education, University and Research (MIUR) under the SIR 2016 call. It consists of approximately 70 hours of spoken data collected at the Universities of Bologna and Turin. The interactions, recorded between 2016 and 2019, involved over 180 speakers, including university students and professors from various regions of Italy, and took place in five different types of communicative situations: lessons, exams, office hours, semi-structured interviews, free conversations (among students). The transcriptions have been anonymized. Overall, the module is made up of 121 conversations and includes 184 speakers. This repository contains: - metadata for both speakers (age, origin, occupation, gender) and conversations (type of interaction), in the metadata subfolder - descriptions of the set of transcription conventions used for this module (Transcription conventions) - transcripts of the recorded conversations in the following formats: .eaf file in eaf/ folder (time-aligned Jefferson-style transcriptions) .txt file in linear-jefferson/ folder (linearized Jefferson-style transcription) .txt file in linear-orthographic/ folder (linearized transcription retaining only orthographic words) .tsv file in tsv/ folder (tokenised version of the transcription) More information can be found in the README.md file. Due to GDPR restrictions, pseudo-anonymized audio files (MP3) are available under a restricted-access license. To request access, please contact the corpus coordinators through the KIParla website and follow the provided procedure. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. | |
Identifier (URI): | http://hdl.handle.net/20.500.11752/OPEN-1048 | |
Language: | Italian | |
Language (ISO639): | ita | |
Publisher: | Alma Mater Studiorum – Università di Bologna | |
Rights: | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | |
http://creativecommons.org/licenses/by-nc-sa/4.0/ | ||
Subject: | human-human spoken dialogues | |
Corpus | ||
Italian Language | ||
spontaneous speech | ||
Type: | corpus | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", National Research Council, in Pisa | |
Description: | http://www.language-archives.org/archive/dspace-clarin-it.ilc.cnr.it | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1048 | |
DateStamp: | 2025-10-07 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Ballarè, Silvia; Goria, Eugenio; Mauri, Caterina. 2025. Alma Mater Studiorum – Università di Bologna. | |
Terms: | area_Europe country_IT dcmi_Text iso639_ita olac_primary_text |