Electrical Engineering
5301 papers in Electrical Engineering
Explore Subcategories
TITLE
DATE
VIEWS
CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments
By Nelson Yalta, Shinji Watanabe, Takaaki Hori · ArXiv: 1811.02735
2019-06-24
0
Seismic Signal Denoising and Decomposition Using Deep Neural Networks
By Weiqiang Zhu, S. Mostafa Mousavi, Gregory C. Beroza · ArXiv: 1811.02695
2020-01-08
0
Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach
By Ran Wang, Yao Wang, Adeen Flinker · ArXiv: 1811.02694
2018-11-09
0
SDR - half-baked or well done?
By Jonathan Le Roux, Scott Wisdom, Hakan Erdogan · ArXiv: 1811.02508
2018-11-07
0
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
By Giovanni Morrone, Luca Pasa, Vadim Tikhanoff · ArXiv: 1811.02480
2021-02-04
0
User Specific Adaptation in Automatic Transcription of Vocalised Percussion
By Antonio Ramires, Rui Penha, Matthew E. P. Davies · ArXiv: 1811.02406
2018-11-07
0
An amplitudes-perturbation data augmentation method in convolutional neural networks for EEG decoding
By Xian-Rui Zhang, Meng-Ying Lei, Yang Li · ArXiv: 1811.02353
2018-11-07
0
NIPS4Bplus: a richly annotated birdsong audio dataset
By Veronica Morfi, Yves Bas, Hanna Pamu{l}a · ArXiv: 1811.02275
2018-11-15
0
Language model integration based on memory control for sequence to sequence speech recognition
By Jaejin Cho, Shinji Watanabe, Takaaki Hori · ArXiv: 1811.02162
2025-03-05
0
Kernel Machines Beat Deep Neural Networks on Mask-based Single-channel Speech Enhancement
By Like Hui, Siyuan Ma, Mikhail Belkin · ArXiv: 1811.02095
2018-11-07
0
How to Improve Your Speaker Embeddings Extractor in Generic Toolkits
By Hossein Zeinali, Lukas Burget, Johan Rohdin · ArXiv: 1811.02066
2018-11-07
0
End-to-End Monaural Multi-speaker ASR System without Pretraining
By Xuankai Chang, Yanmin Qian, Kai Yu · ArXiv: 1811.02062
2018-11-07
0
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
By Ye Jia, Melvin Johnson, Wolfgang Macherey · ArXiv: 1811.02050
2019-02-12
0
End-to-End Sound Source Separation Conditioned On Instrument Labels
By Olga Slizovskaia, Leo Kim, Gloria Haro · ArXiv: 1811.01850
2019-05-10
1
Cycle-consistency training for end-to-end speech recognition
By Takaaki Hori, Ramon Astudillo, Tomoki Hayashi · ArXiv: 1811.01690
2019-05-24
0
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion
By Hirokazu Kameoka, Kou Tanaka, Damian Kwasny · ArXiv: 1811.01609
2020-10-08
0
Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information
By Efthymios Tzinis, Shrikant Venkataramani, Paris Smaragdis · ArXiv: 1811.01531
2021-05-14
0
Communication Through Breath: Aerosol Transmission
By Maryam Khalid, Osama Amin, Sajid Ahmed · ArXiv: 1811.01393
2018-11-06
0
Investigating context features hidden in End-to-End TTS
By Kohki Mametani, Tsuneo Kato, Seiichi Yamamoto · ArXiv: 1811.01376
2019-02-26
0
Towards Unsupervised Speech-to-Text Translation
By Yu-An Chung, Wei-Hung Weng, Schrasing Tong · ArXiv: 1811.01307
2018-11-06
1
Multitask learning for frame-level instrument recognition
By Yun-Ning Hung, Yi-An Chen, Yi-Hsuan Yang · ArXiv: 1811.01143
2019-02-19
0
Deep Segment Attentive Embedding for Duration Robust Speaker Verification
By Bin Liu, Shuai Nie, Yaping Zhang · ArXiv: 1811.00883
2018-11-05
0
Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks
By Emad M. Grais, Hagen Wierstorf, Dominic Ward · ArXiv: 1811.00454
2019-06-25
1
Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models
By Herman Kamper · ArXiv: 1811.00403
2019-04-16
1
End-to-end Models with auditory attention in Multi-channel Keyword Spotting
By Haitong Zhang, Junbo Zhang, Yujun Wang · ArXiv: 1811.00350
2018-11-06
0
Sequence-to-sequence Models for Small-Footprint Keyword Spotting
By Haitong Zhang, Junbo Zhang, Yujun Wang · ArXiv: 1811.00348
2018-11-02
0
Neural Music Synthesis for Flexible Timbre Control
By Jong Wook Kim, Rachel Bittner, Aparna Kumar · ArXiv: 1811.00223
2018-11-02
0
Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition
By David B. Ramsay, Kevin Kilgour, Dominik Roblek · ArXiv: 1811.00006
2018-11-02
0
WaveGlow: A Flow-based Generative Network for Speech Synthesis
By Ryan Prenger, Rafael Valle, Bryan Catanzaro · ArXiv: 1811.00002
2018-11-02
0