applsci-logo

Journal Browser

Journal Browser

Automatic Speech Signal Processing

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Acoustics and Vibrations".

Deadline for manuscript submissions: closed (29 February 2024) | Viewed by 4025

Special Issue Editor


E-Mail Website
Guest Editor
Department of Electronics and Information Engineering, Beihang University, Beijing 100191, China
Interests: affective computing; pattern recognition; human–computer interaction
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Speech signals are an important medium for Human-to-Computer Interactions. Speech has the advantages of easy access, a small amount of data, a non-line-of-sight effect and other advantages. It is also vulnerable to interference, strong information coupling and other challenges. Major progress is being published regularly on both the technology and exploitation of Automatic Speech Signal Processing (ASSP). However, there are still technological barriers to flexible solutions and user satisfaction under some circumstances. This is related to several factors, such as sensitivity to the environment (background noise) or the weak representation of grammatical and semantic knowledge. Current research also emphasizes deficiencies in dealing with variation naturally present in speech. For instance, the lack of robustness to foreign accents precludes use by specific populations. We are interested in articles that explore robust ASSP systems. Potential topics include but are not limited to the following:

  • Automatic speech segmentation and phoneme detection;
  • Automatic speech recognition with noised speech;
  • Automatic speech translation;
  • Automatic speech synthesis;
  • Automatic classification of emotions in speech;
  • Multi-modal speech recognition with video or physiological signals.

Dr. Lijiang Chen
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • e-Book format: Special Issues with more than 10 articles can be published as dedicated e-books, ensuring wide and rapid dissemination.

Further information on MDPI's Special Issue polices can be found here.

Published Papers (2 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

19 pages, 4712 KiB  
Article
Analysis and Investigation of Speaker Identification Problems Using Deep Learning Networks and the YOHO English Speech Dataset
by Nourah M. Almarshady, Adal A. Alashban and Yousef A. Alotaibi
Appl. Sci. 2023, 13(17), 9567; https://doi.org/10.3390/app13179567 - 24 Aug 2023
Cited by 4 | Viewed by 2199
Abstract
The rapid momentum of deep neural networks (DNNs) in recent years has yielded state-of-the-art performance in various machine-learning tasks using speaker identification systems. Speaker identification is based on the speech signals and the features that can be extracted from them. In this article, [...] Read more.
The rapid momentum of deep neural networks (DNNs) in recent years has yielded state-of-the-art performance in various machine-learning tasks using speaker identification systems. Speaker identification is based on the speech signals and the features that can be extracted from them. In this article, we proposed a speaker identification system using the developed DNNs models. The system is based on the acoustic and prosodic features of the speech signal, such as pitch frequency (vocal cords vibration rate), energy (loudness of speech), their derivations, and any additional acoustic and prosodic features. Additionally, the article investigates the existing recurrent neural networks (RNNs) models and adapts them to design a speaker identification system using the public YOHO LDC dataset. The average accuracy of the system was 91.93% in the best experiment for speaker identification. Furthermore, this paper helps uncover reasons for analyzing speakers and tokens yielding major errors to increase the system’s robustness regarding feature selection and system tune-up. Full article
(This article belongs to the Special Issue Automatic Speech Signal Processing)
Show Figures

Figure 1

15 pages, 473 KiB  
Article
Crossband Filtering for Weighted Prediction Error-Based Speech Dereverberation
by Tomer Rosenbaum, Israel Cohen and Emil Winebrand
Appl. Sci. 2023, 13(17), 9537; https://doi.org/10.3390/app13179537 - 23 Aug 2023
Cited by 1 | Viewed by 1141
Abstract
Weighted prediction error (WPE) is a linear prediction-based method extensively used to predict and attenuate the late reverberation component of an observed speech signal. This paper introduces an extended version of the WPE method to enhance the modeling accuracy in the time–frequency domain [...] Read more.
Weighted prediction error (WPE) is a linear prediction-based method extensively used to predict and attenuate the late reverberation component of an observed speech signal. This paper introduces an extended version of the WPE method to enhance the modeling accuracy in the time–frequency domain by incorporating crossband filters. Two approaches to extending the WPE while considering crossband filters are proposed and investigated. The first approach improves the model’s accuracy. However, it increases the computational complexity, while the second approach maintains the same computational complexity as the conventional WPE while still achieving improved accuracy and comparable performance to the first approach. To validate the effectiveness of the proposed methods, extensive simulations are conducted. The experimental results demonstrate that both methods outperform the conventional WPE regarding dereverberation performance. These findings highlight the potential of incorporating crossband filters in improving the accuracy and efficacy of the WPE method for dereverberation tasks. Full article
(This article belongs to the Special Issue Automatic Speech Signal Processing)
Show Figures

Figure 1

Back to TopTop