Electrical Engineering and Systems Science / Audio Processing

All posts under category "Electrical Engineering and Systems Science / Audio Processing"

13 posts total
Sorted by date
SA-SSL-MOS: Self-supervised Learning MOS Prediction with Spectral Augmentation for Generalized Multi-Rate Speech Assessment

SA-SSL-MOS: Self-supervised Learning MOS Prediction with Spectral Augmentation for Generalized Multi-Rate Speech Assessment

Designing a speech quality assessment (SQA) system for estimating mean-opinion-score (MOS) of multi-rate speech with varying sampling frequency (16-48 kHz) is a challenging task. The challenge arises due to the limited availability of a MOS-labeled training dataset comprising multi-rate speech sampl

Learning Electrical Engineering and Systems Science Audio Processing
Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios

Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios

Target speech extraction (TSE) typically relies on pre-recorded high-quality enrollment speech, which disrupts user experience and limits feasibility in spontaneous interaction. In this paper, we propose Enroll-on-Wakeup (EoW), a novel framework where the wake-word segment, captured naturally during

Electrical Engineering and Systems Science Audio Processing
A Robust Target Linearly Constrained Minimum Variance Beamformer With   Spatial Cues Preservation for Binaural Hearing Aids

A Robust Target Linearly Constrained Minimum Variance Beamformer With Spatial Cues Preservation for Binaural Hearing Aids

In this paper, a binaural beamforming algorithm for hearing aid applications is introduced.The beamforming algorithm is designed to be robust to some error in the estimate of the target speaker direction. The algorithm has two main components: a robust target linearly constrained minimum variance (T

Computer Science Sound Electrical Engineering and Systems Science Audio Processing
Interpreting DNN output layer activations: A strategy to cope with   unseen data in speech recognition

Interpreting DNN output layer activations: A strategy to cope with unseen data in speech recognition

Unseen data can degrade performance of deep neural net acoustic models. To cope with unseen data, adaptation techniques are deployed. For unlabeled unseen data, one must generate some hypothesis given an existing model, which is used as the label for model adaptation. However, assessing the goodness

NLP Electrical Engineering and Systems Science Data Computer Science Sound Audio Processing
Masked Conditional Neural Networks for Automatic Sound Events   Recognition

Masked Conditional Neural Networks for Automatic Sound Events Recognition

Deep neural network architectures designed for application domains other than sound, especially image recognition, may not optimally harness the time-frequency representation when adapted to the sound recognition problem. In this work, we explore the ConditionaL Neural Network (CLNN) and the Masked

Statistics Electrical Engineering and Systems Science Network Machine Learning Computer Science Sound Audio Processing
Polyphonic audio tagging with sequentially labelled data using CRNN with   learnable gated linear units

Polyphonic audio tagging with sequentially labelled data using CRNN with learnable gated linear units

Audio tagging aims to detect the types of sound events occurring in an audio recording. To tag the polyphonic audio recordings, we propose to use Connectionist Temporal Classification (CTC) loss function on the top of Convolutional Recurrent Neural Network (CRNN) with learnable Gated Linear Units (G

Data Electrical Engineering and Systems Science Computer Science Sound Audio Processing

< Category Statistics (Total: 5017) >

General Relativity
66
General Research
781
HEP-EX
25
HEP-LAT
7
HEP-PH
79
HEP-TH
67
MATH-PH
79
NUCL-EX
6
NUCL-TH
14
Quantum Physics
67

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut