HuMenDisCo: A Hungarian Speech Corpus of Schizophrenia, Schizoaffective and Bipolar Disorders

Author:

Szabó Martina Katalin1,Vincze Veronika2,Guba Csenge3,Dam Bernadett4,Solymos Adrienn5,Bagi Anita5,Szendi István6

Affiliation:

1. Institute of Global Studies, Tokyo University of Foreign Studies

2. ELKH-SZTE Research Group on Artificial Intelligence

3. Doctoral School of Linguistics, University of Szeged

4. Department of General Linguistics, University of Szeged

5. Department of Hungarian Linguistics, University of Szeged

6. Psychiatry Unit, Kiskunhalas Semmelweis Hospital, University Teaching Hospital

Abstract

AbstractHere we present a Hungarian corpus of spontaneous speech texts produced by patients with schizophrenia, schizoaffective or bipolar disorder, as well as those of healthy controls. Recordings which were later transcribed were produced in three different directed spontaneous speech tasks in a clinical environment. The survey was carried out involving 90 subjects and 526 texts were produced. Then, the collected recordings were manually transcribed by our research group. The written corpus texts were processed with a set of Natural Language Processing methods and tools. The final corpus consists of 158,386 tokens all together, without punctuation. During the data processing procedure, we also applied specific lexicons to enable us to examine linguistic intensification in the case of mental disorders. The dataset can be utilized in several related research tasks, like semantic-pragmatic analyses and in the automatic discrimination of the patients and the controls using our linguistic features.

Publisher

Research Square Platform LLC

Reference57 articles.

1. On the subjectivity of intensifiers;Athanasiadou A;Language sciences,2007

2. Bagi, A., Gosztolya, G., Szalóki, S., Szendi, I., & Hoffmann, I. (2019). Szkizofrénia azonosítása spontán beszéd temporális paraméterei alapján – egy pilot kutatás eredményei [Identifying schizophrenia based on temporal parameters in spontaneous speech – Results of a pilot study]. In B. Gábor, G. Gábor, & V. Veronika (Eds.), XV. Magyar Számítógépes Nyelvészeti Konferencia, pp. 189–201. Szegedi Tudományegyetem, Informatikai Intézet.

3. The etiology of schizophrenia and the origin of language: overview of a theory;Berlim MT;Comprehensive psychiatry,2003

4. Bickerton, D. (1990). Language and Species. Chicago, IL: University of Chicago.

5. Bickerton, D. (1995). Language and Human Behaviour. Seattle, WA: University of Washington.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3