Semi-Supervised Deep Time-Delay Embedded Clustering for Stress Speech Analysis-Reference-Cited by-同舟云学术

Semi-Supervised Deep Time-Delay Embedded Clustering for Stress Speech Analysis

Published:2019-11-01 Issue:11 Volume:8 Page:1263
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Prasetio Barlian Henryranu^ORCID,Tamura Hiroki^ORCID,Tanno Koichi

Abstract

Real stressed speech is affected by various aspects (individual characteristics and environment) so that the stress patterns are diverse and different on each individual. To this end, in our previous work, we performed an unsupervised clustering method that able to self-learning manner by mapping the feature representations of the stress speech and clustering tasks simultaneously, called deep time-delay embedded clustering (DTEC). However, DTEC has not confirmed yet the compatibility between the output class and informational classes. Therefore, we proposed semi-supervised time-delay embedded clustering (SDTEC) as a new framework of semi-supervised in DTEC. SDTEC incorporates the prior information of pairwise constraints in the embedding layer and simultaneously learns the feature representation and the clustering assignments. The prior information was used to guide the clustering procedure so that the points that belong to the incorrect cluster can be corrected. The effectiveness of the proposed SDTEC was evaluated by comparing it with some baseline methods in terms of the clustering error rate (CER). Moreover, to demonstrate SDTEC’s capabilities, we conducted a comprehensive ablation study. Based on experiment results, SDTEC outperformed the baseline methods and achieves state-of-the-art results in semi-supervised clustering.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/8/11/1263/pdf

Reference41 articles.

1. Unconscious emotion: A cognitive neuroscientific perspective

2. Autonomic and endocrine control of cardiovascular function

3. Speech Under Stress: Analysis, Modeling and Recognition;Hansen,2007

4. Mechanics of human voice production and control

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Stressed Speech Recognition Using Smartphone and Embedded Device Integration;Proceedings of the 8th International Conference on Sustainable Information Engineering and Technology;2023-10-24

2. An Accelerator for Semi-Supervised Classification with Granulation Selection;Electronics;2023-05-15

3. A Study of Stressed Facial Recognition Based on Histogram Information;Informatica;2022-06-15

4. A Novel Neural Network-Based Approach to Classification of Implicit Emotional Components in Ordinary Speech;Optical Memory and Neural Networks;2021-01

5. Deep time-delay Markov network for prediction and modeling the stress and emotions state transition;Scientific Reports;2020-10-22