Audio Processing

All posts under tag "Audio Processing"

9 posts total
Sorted by date
SA-SSL-MOS: Self-supervised Learning MOS Prediction with Spectral Augmentation for Generalized Multi-Rate Speech Assessment

SA-SSL-MOS: Self-supervised Learning MOS Prediction with Spectral Augmentation for Generalized Multi-Rate Speech Assessment

Designing a speech quality assessment (SQA) system for estimating mean-opinion-score (MOS) of multi-rate speech with varying sampling frequency (16-48 kHz) is a challenging task. The challenge arises due to the limited availability of a MOS-labeled training dataset comprising multi-rate speech sampl

Audio Processing Learning Electrical Engineering and Systems Science
Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios

Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios

Target speech extraction (TSE) typically relies on pre-recorded high-quality enrollment speech, which disrupts user experience and limits feasibility in spontaneous interaction. In this paper, we propose Enroll-on-Wakeup (EoW), a novel framework where the wake-word segment, captured naturally during

Audio Processing Electrical Engineering and Systems Science

< Category Statistics (Total: 5005) >

General Relativity
59
General Research
699
HEP-EX
14
HEP-LAT
8
HEP-PH
63
HEP-TH
68
MATH-PH
82
NUCL-EX
5
NUCL-TH
15
Quantum Physics
57

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut