OLAC Record
oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1048

Metadata
Title:KIParla - KIP transcripts
Bibliographic Citation:http://hdl.handle.net/20.500.11752/OPEN-1048
Creator:Ballarè, Silvia
Goria, Eugenio
Mauri, Caterina
Date (W3CDTF):2025-10-07T05:23:40Z
Date Available:2025-10-07T05:23:40Z
Description:The KIP corpus is part of the larger KIParla collection (www.kiparla.it), which can be freely queried through the NoSketch Engine interface. The KIP corpus was compiled within the framework of the LEAdhoC project – Linguistic Expression of Ad Hoc Categories, funded by the Italian Ministry of Education, University and Research (MIUR) under the SIR 2016 call. It consists of approximately 70 hours of spoken data collected at the Universities of Bologna and Turin. The interactions, recorded between 2016 and 2019, involved over 180 speakers, including university students and professors from various regions of Italy, and took place in five different types of communicative situations: lessons, exams, office hours, semi-structured interviews, free conversations (among students). The transcriptions have been anonymized. Overall, the module is made up of 121 conversations and includes 184 speakers. This repository contains: - metadata for both speakers (age, origin, occupation, gender) and conversations (type of interaction), in the metadata subfolder - descriptions of the set of transcription conventions used for this module (Transcription conventions) - transcripts of the recorded conversations in the following formats: .eaf file in eaf/ folder (time-aligned Jefferson-style transcriptions) .txt file in linear-jefferson/ folder (linearized Jefferson-style transcription) .txt file in linear-orthographic/ folder (linearized transcription retaining only orthographic words) .tsv file in tsv/ folder (tokenised version of the transcription) More information can be found in the README.md file. Due to GDPR restrictions, pseudo-anonymized audio files (MP3) are available under a restricted-access license. To request access, please contact the corpus coordinators through the KIParla website and follow the provided procedure. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Identifier (URI):http://hdl.handle.net/20.500.11752/OPEN-1048
Language:Italian
Language (ISO639):ita
Publisher:Alma Mater Studiorum – Università di Bologna
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:human-human spoken dialogues
Corpus
Italian Language
spontaneous speech
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", National Research Council, in Pisa
Description:  http://www.language-archives.org/archive/dspace-clarin-it.ilc.cnr.it
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1048
DateStamp:  2025-10-07
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Ballarè, Silvia; Goria, Eugenio; Mauri, Caterina. 2025. Alma Mater Studiorum – Università di Bologna.
Terms: area_Europe country_IT dcmi_Text iso639_ita olac_primary_text


http://www.language-archives.org/item.php/oai:dspace-clarin-it.ilc.cnr.it:000-c0-111/1048
Up-to-date as of: Wed Oct 8 0:33:10 EDT 2025