We gratefully acknowledge support from
the Simons Foundation
and member institutions

Sound

Authors and titles for recent submissions

[ total of 8 entries: 1-8 ]
[ showing up to 25 entries per page: fewer | more ]

Fri, 18 May 2018

[1]  arXiv:1805.06572 [pdf, ps, other]
Title: FastFCA: A Joint Diagonalization Based Fast Algorithm for Audio Source Separation Using A Full-Rank Spatial Covariance Model
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)

Thu, 17 May 2018

[2]  arXiv:1805.06234 [pdf, other]
Title: PSD Estimation and Source Separation in a Noisy Reverberant Environment using a Spherical Microphone Array
Subjects: Sound (cs.SD)
[3]  arXiv:1805.06239 (cross-list from eess.AS) [pdf, other]
Title: A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)

Wed, 16 May 2018

[4]  arXiv:1805.05826 [pdf, other]
Title: A Purely End-to-end System for Multi-speaker Speech Recognition
Comments: ACL 2018
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[5]  arXiv:1805.05324 [pdf, other]
Title: Extended pipeline for content-based feature engineering in music genre recognition
Authors: Tina Raissi (1), Alessandro Tibo (2), Paolo Bientinesi (1), ((1) RWTH Aachen University, (2) University of Florence)
Comments: ICASSP 2018
Subjects: Sound (cs.SD); Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[6]  arXiv:1805.05574 (cross-list from cs.CL) [pdf, other]
Title: Improved ASR for Under-Resourced Languages Through Multi-Task Learning with Acoustic Landmarks
Comments: Submitted in Interspeech2018
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Tue, 15 May 2018

[7]  arXiv:1805.04792 (cross-list from cs.GR) [pdf, other]
Title: Scene-Aware Audio for 360\textdegree{} Videos
Comments: SIGGRAPH 2018, Technical Papers, 12 pages, 17 figures, this http URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Fri, 11 May 2018

[8]  arXiv:1805.03647 [pdf, other]
Title: End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input
Comments: accepted to IJCNN 2018
Subjects: Sound (cs.SD); Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[ total of 8 entries: 1-8 ]
[ showing up to 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)