Speech Technology Progress Based on New Machine Learning Paradigm-Reference-Cited by-同舟云学术

Speech Technology Progress Based on New Machine Learning Paradigm

Published:2019-06-25 Issue: Volume:2019 Page:1-19
ISSN:1687-5265
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Delić Vlado¹^ORCID,Perić Zoran²^ORCID,Sečujski Milan¹^ORCID,Jakovljević Nikša¹^ORCID,Nikolić Jelena²^ORCID,Mišković Dragiša¹^ORCID,Simić Nikola²^ORCID,Suzić Siniša¹^ORCID,Delić Tijana¹^ORCID

Affiliation:

1. University of Novi Sad, Faculty of Technical Sciences, 21000 Novi Sad, Serbia

2. University of Niš, Faculty of Electronic Engineering, 18000 Niš, Serbia

Abstract

Speech technologies have been developed for decades as a typical signal processing area, while the last decade has brought a huge progress based on new machine learning paradigms. Owing not only to their intrinsic complexity but also to their relation with cognitive sciences, speech technologies are now viewed as a prime example of interdisciplinary knowledge area. This review article on speech signal analysis and processing, corresponding machine learning algorithms, and applied computational intelligence aims to give an insight into several fields, covering speech production and auditory perception, cognitive aspects of speech communication and language understanding, both speech recognition and text-to-speech synthesis in more details, and consequently the main directions in development of spoken dialogue systems. Additionally, the article discusses the concepts and recent advances in speech signal compression, coding, and transmission, including cognitive speech coding. To conclude, the main intention of this article is to highlight recent achievements and challenges based on new machine learning paradigms that, over the last decade, had an immense impact in the field of speech signal processing.

Funder

Ministarstvo Prosvete, Nauke i Tehnološkog Razvoja

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2019/4368036.pdf

Reference89 articles.

1. Re-creating the sigsaly quantizer: This 1943 analog-to-digital converter gave the allies an unbreakable scrambler - [Resources]

2. Digital coding of waveforms. Principles and applications to speech and video

Cited by 39 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Drinkers Voice Recognition Intelligent System: An Ensemble Stacking Machine Learning Approach;Annals of Data Science;2024-07-07

2. Natural Language Processing for Recognizing Bangla Speech with Regular and Regional Dialects: A Survey of Algorithms and Approaches;2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC);2024-07-02

3. Artificial Intelligence's Role and Future Implementation in Education;Advances in Educational Technologies and Instructional Design;2024-06-14

4. Parkinson's Disease Diagnosis Using Voice Features and Effective Machine Learning Methods;Advances in Medical Technologies and Clinical Practice;2024-02-23

5. Towards more flexible human-machine speech communication;2023 31st Telecommunications Forum (TELFOR);2023-11-21