Electrical Engineering

5301 papers in Electrical Engineering

Explore Subcategories

Eess Sp

73 Papers

TITLE

DATE

VIEWS

CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments

By Nelson Yalta, Shinji Watanabe, Takaaki Hori · ArXiv: 1811.02735

2019-06-24

Seismic Signal Denoising and Decomposition Using Deep Neural Networks

By Weiqiang Zhu, S. Mostafa Mousavi, Gregory C. Beroza · ArXiv: 1811.02695

2020-01-08

Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach

By Ran Wang, Yao Wang, Adeen Flinker · ArXiv: 1811.02694

2018-11-09

SDR - half-baked or well done?

By Jonathan Le Roux, Scott Wisdom, Hakan Erdogan · ArXiv: 1811.02508

2018-11-07

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

By Giovanni Morrone, Luca Pasa, Vadim Tikhanoff · ArXiv: 1811.02480

2021-02-04

User Specific Adaptation in Automatic Transcription of Vocalised Percussion

By Antonio Ramires, Rui Penha, Matthew E. P. Davies · ArXiv: 1811.02406

2018-11-07

An amplitudes-perturbation data augmentation method in convolutional neural networks for EEG decoding

By Xian-Rui Zhang, Meng-Ying Lei, Yang Li · ArXiv: 1811.02353

2018-11-07

NIPS4Bplus: a richly annotated birdsong audio dataset

By Veronica Morfi, Yves Bas, Hanna Pamu{l}a · ArXiv: 1811.02275

2018-11-15

Language model integration based on memory control for sequence to sequence speech recognition

By Jaejin Cho, Shinji Watanabe, Takaaki Hori · ArXiv: 1811.02162

2025-03-05

Kernel Machines Beat Deep Neural Networks on Mask-based Single-channel Speech Enhancement

By Like Hui, Siyuan Ma, Mikhail Belkin · ArXiv: 1811.02095

2018-11-07

How to Improve Your Speaker Embeddings Extractor in Generic Toolkits

By Hossein Zeinali, Lukas Burget, Johan Rohdin · ArXiv: 1811.02066

2018-11-07

End-to-End Monaural Multi-speaker ASR System without Pretraining

By Xuankai Chang, Yanmin Qian, Kai Yu · ArXiv: 1811.02062

2018-11-07

Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation

By Ye Jia, Melvin Johnson, Wolfgang Macherey · ArXiv: 1811.02050

2019-02-12

End-to-End Sound Source Separation Conditioned On Instrument Labels

By Olga Slizovskaia, Leo Kim, Gloria Haro · ArXiv: 1811.01850

2019-05-10

Cycle-consistency training for end-to-end speech recognition

By Takaaki Hori, Ramon Astudillo, Tomoki Hayashi · ArXiv: 1811.01690

2019-05-24

ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion

By Hirokazu Kameoka, Kou Tanaka, Damian Kwasny · ArXiv: 1811.01609

2020-10-08

Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information

By Efthymios Tzinis, Shrikant Venkataramani, Paris Smaragdis · ArXiv: 1811.01531

2021-05-14

Communication Through Breath: Aerosol Transmission

By Maryam Khalid, Osama Amin, Sajid Ahmed · ArXiv: 1811.01393

2018-11-06

Investigating context features hidden in End-to-End TTS

By Kohki Mametani, Tsuneo Kato, Seiichi Yamamoto · ArXiv: 1811.01376

2019-02-26

Towards Unsupervised Speech-to-Text Translation

By Yu-An Chung, Wei-Hung Weng, Schrasing Tong · ArXiv: 1811.01307

2018-11-06

Deep Ad-hoc Beamforming

By Xiao-Lei Zhang · ArXiv: 1811.01233

2021-02-10

Multitask learning for frame-level instrument recognition

By Yun-Ning Hung, Yi-An Chen, Yi-Hsuan Yang · ArXiv: 1811.01143

2019-02-19

Deep Segment Attentive Embedding for Duration Robust Speaker Verification

By Bin Liu, Shuai Nie, Yaping Zhang · ArXiv: 1811.00883

2018-11-05

Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks

By Emad M. Grais, Hagen Wierstorf, Dominic Ward · ArXiv: 1811.00454

2019-06-25

Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models

By Herman Kamper · ArXiv: 1811.00403

2019-04-16

End-to-end Models with auditory attention in Multi-channel Keyword Spotting

By Haitong Zhang, Junbo Zhang, Yujun Wang · ArXiv: 1811.00350

2018-11-06

Sequence-to-sequence Models for Small-Footprint Keyword Spotting

By Haitong Zhang, Junbo Zhang, Yujun Wang · ArXiv: 1811.00348

2018-11-02

Neural Music Synthesis for Flexible Timbre Control

By Jong Wook Kim, Rachel Bittner, Aparna Kumar · ArXiv: 1811.00223

2018-11-02

Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition

By David B. Ramsay, Kevin Kilgour, Dominik Roblek · ArXiv: 1811.00006

2018-11-02

WaveGlow: A Flow-based Generative Network for Speech Synthesis

By Ryan Prenger, Rafael Valle, Bryan Catanzaro · ArXiv: 1811.00002

2018-11-02

«« « 125 126 127 128 129 130 131 132 133 134 » »»