452 Artículos

« Anterior Página: 1 de 23 Siguiente »

Intelligibility of English Mosaic Speech: Comparison between Native and Non-Native Speakers of English

Acceso

en línea

Santi, Yoshitaka Nakajima, Kazuo Ueda and Gerard B. Remijn

Mosaic speech is degraded speech that is segmented into time × frequency blocks. Earlier research with Japanese mosaic speech has shown that its intelligibility is almost perfect for mosaic block durations (MBD) up to 40 ms. The purpose of the present st... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 19 Año: 2020

Speech Compression

Acceso

en línea

Jerry D. Gibson

Speech compression is a key technology underlying digital cellular communications, VoIP, voicemail, and voice response systems. We trace the evolution of speech coding based on the linear prediction model, highlight the key milestones in speech coding, a... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 7 Num: 0 Par: 2 Año: 2016

MASS: Microphone Array Speech Simulator in Room Acoustic Environment for Multi-Channel Speech Coding and Enhancement

Acceso

en línea

Rui Cheng, Changchun Bao and Zihao Cui

The proposed MASS can simulate multiple signals collected by microphone array in room acoustic environment for multi-channel speech coding and enhancement.

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 4 Año: 2020

The Usefulness of Imperfect Speech Data for ASR Development in Low-Resource Languages

Acceso

en línea

Jaco Badenhorst and Febe de Wet

When the National Centre for Human Language Technology (NCHLT) Speech corpus was released, it created various opportunities for speech technology development in the 11 official, but critically under-resourced, languages of South Africa. Since then, the s... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 9 Año: 2019

Technologies and Applications Review of Subvocal Speech

Acceso

en línea

Olga Lucia Ramos Sandoval, Erika Nathalia Gamma Melo, Dario Amaya Hurtado Pág. 287 - 298

Abstract AuthorsDownloadsReferencesHow to Cite

Revista: Ingeniería Formato: Electrónico

Tabla de contenido: Vol: 20 Num: 2 Par: 0 Año: 1967

Multilingual Speech Recognition for Turkic Languages

Acceso

en línea

Saida Mussakhojayeva, Kaisar Dauletbek, Rustem Yeshpanov and Huseyin Atakan Varol

The primary aim of this study was to contribute to the development of multilingual automatic speech recognition for lower-resourced Turkic languages. Ten languages?Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek?we... ver más

Revista: Information Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 2 Año: 2023

Speaker-Independent Spectral Enhancement for Bone-Conducted Speech

Acceso

en línea

Liangliang Cheng, Yunfeng Dou, Jian Zhou, Huabin Wang and Liang Tao

Because of the acoustic characteristics of bone-conducted (BC) speech, BC speech can be enhanced to better communicate in a complex environment with high noise. Existing BC speech enhancement models have weak spectral recovery capability for the high-fre... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 3 Año: 2023

Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features

Acceso

en línea

Qiang Zhu, Zhong Wang, Yunfeng Dou and Jian Zhou

A conversion method based on the inversion of Mel frequency cepstral coefficient (MFCC) features was proposed to convert whispered speech into normal speech. First, the MFCC features of whispered speech and normal speech were extracted and a matching rel... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 2 Año: 2022

A Preprocessing Strategy for Denoising of Speech Data Based on Speech Segment Detection

Acceso

en línea

Seung-Jun Lee and Hyuk-Yoon Kwon

In this paper, we propose a preprocessing strategy for denoising of speech data based on speech segment detection. A design of computationally efficient speech denoising is necessary to develop a scalable method for large-scale data sets. Furthermore, it... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 20 Año: 2020

Speech Enhancement for Hearing Aids with Deep Learning on Environmental Noises

Acceso

en línea

Gyuseok Park, Woohyeong Cho, Kyu-Sung Kim and Sangmin Lee

Hearing aids are small electronic devices designed to improve hearing for persons with impaired hearing, using sophisticated audio signal processing algorithms and technologies. In general, the speech enhancement algorithms in hearing aids remove the env... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 10 Num: 0 Par: 17 Año: 2020

A Light-Weight Autoregressive CNN-Based Frame Level Transducer Decoder for End-to-End ASR

Acceso

en línea

Hyeon-Kyu Noh and Hong-June Park

A convolutional neural network (CNN) transducer decoder was proposed to reduce the decoding time of an end-to-end automatic speech recognition (ASR) system while maintaining accuracy. The CNN of 177 k parameters and a kernel size of 6 generates the proba... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 14 Num: 0 Par: 3 Año: 2024

Speech Inpainting Based on Multi-Layer Long Short-Term Memory Networks

Acceso

en línea

Haohan Shi, Xiyu Shi and Safak Dogan

Audio inpainting plays an important role in addressing incomplete, damaged, or missing audio signals, contributing to improved quality of service and overall user experience in multimedia communications over the Internet and mobile networks. This paper p... ver más

Revista: Future Internet Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 2 Año: 2024

The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters

Acceso

en línea

Nurgali Kadyrbek, Madina Mansurova, Adai Shomanov and Gaukhar Makharova

This study is devoted to the transcription of human speech in the Kazakh language in dynamically changing conditions. It discusses key aspects related to the phonetic structure of the Kazakh language, technical considerations in collecting the transcribe... ver más

Revista: Big Data and Cognitive Computing Formato: Electrónico

Tabla de contenido: Vol: 7 Num: 0 Par: 3 Año: 2023

Influence of Test Room Acoustics on Non-Native Listeners? Standardized Test Performance

Acceso

en línea

Makito Kawata, Mariko Tsuruta-Hamamura and Hiroshi Hasegawa

Understanding the impact of room acoustics on non-native listeners is crucial, particularly in standardized English as a foreign language (EFL) proficiency testing environments. This study aims to elucidate how acoustics influence test scores, considerin... ver más

Revista: Acoustics Formato: Electrónico

Tabla de contenido: Vol: 5 Num: 0 Par: 4 Año: 2023

Non-Special Loudspeakers as Speech Test Sources in Natural Acoustics Speech Intelligibility Investigations

Acceso

en línea

Luis Gomez-Agustina, Haydar Aygun and Liji Suseela Thankom Mohan

Objective speech intelligibility estimations undertaken in natural acoustics speech communications (NAS) scenarios require the utilization of a speech source that approximates the acoustic characteristics of a human talker. Only a limited number of speci... ver más

Revista: Acoustics Formato: Electrónico

Tabla de contenido: Vol: 5 Num: 0 Par: 3 Año: 2023

Speech Enhancement Based on Two-Stage Processing with Deep Neural Network for Laser Doppler Vibrometer

Acceso

en línea

Chengkai Cai, Kenta Iwai and Takanobu Nishiura

The development of distant-talk measurement systems has been attracting attention since they can be applied to many situations such as security and disaster relief. One such system that uses a device called a laser Doppler vibrometer (LDV) to acquire sou... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 3 Año: 2023

Semisupervised Speech Data Extraction from Basque Parliament Sessions and Validation on Fully Bilingual Basque?Spanish ASR

Acceso

en línea

Mikel Penagarikano, Amparo Varona, Germán Bordel and Luis Javier Rodriguez-Fuentes

In this paper, a semisupervised speech data extraction method is presented and applied to create a new dataset designed for the development of fully bilingual Automatic Speech Recognition (ASR) systems for Basque and Spanish. The dataset is drawn from an... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 14 Año: 2023

High-Quality Data from Crowdsourcing towards the Creation of a Mexican Anti-Immigrant Speech Corpus

Acceso

en línea

Alejandro Molina-Villegas, Thomas Cattin, Karina Gazca-Hernandez and Edwin Aldana-Bobadilla

Currently, a significant portion of published research on online hate speech relies on existing textual corpora. However, when examining a specific context, there is a lack of preexisting datasets that include the particularities associated with various ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 14 Año: 2023

ODIN112?AI-Assisted Emergency Services in Romania

Acceso

en línea

Dan Ungureanu, Stefan-Adrian Toma, Ion-Dorinel Filip, Bogdan-Costel Mocanu, Iulian Aciobani?ei, Bogdan Marghescu, Titus Balan, Mihai Dascalu, Ion Bica and Florin Pop

The evolution of Natural Language Processing technologies transformed them into viable choices for various accessibility features and for facilitating interactions between humans and computers. A subset of them consists of speech processing systems, such... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 1 Año: 2023

Semi-Supervised Learning for Robust Emotional Speech Synthesis with Limited Data

Acceso

en línea

Jialin Zhang, Mairidan Wushouer, Gulanbaier Tuerhong and Hanfang Wang

Emotional speech synthesis is an important branch of human?computer interaction technology that aims to generate emotionally expressive and comprehensible speech based on the input text. With the rapid development of speech synthesis technology based on ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 13 Num: 0 Par: 9 Año: 2023

« Anterior Página: 1 de 23 Siguiente »