We gratefully acknowledge support from
the Simons Foundation
and member institutions

Sound

Authors and titles for recent submissions

[ total of 15 entries: 1-15 ]
[ showing up to 25 entries per page: fewer | more ]

Fri, 17 Nov 2017

[1]  arXiv:1711.05747 [pdf, other]
Title: Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
Subjects: Sound (cs.SD); Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
[2]  arXiv:1711.05734 (cross-list from cs.DC) [pdf, other]
Title: Chipmunk: A Systolically Scalable 0.9 mm${}^2$, 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Sound (cs.SD)

Thu, 16 Nov 2017

[3]  arXiv:1711.05447 [pdf, other]
Title: Emotional End-to-End Neural Speech Synthesizer
Comments: 5 pages, 3 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[4]  arXiv:1711.05443 [pdf, other]
Title: Human and Machine Speaker Recognition Based on Short Trivial Events
Comments: Submitted to ICASSP 2018
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
[5]  arXiv:1711.05260 [pdf, other]
Title: Optimal Tuning of Two-Dimensional Keyboards
Comments: 14 page, 3 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[6]  arXiv:1711.05551 (cross-list from eess.AS) [pdf, other]
Title: Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results
Authors: Grégoire Lafay (1), Emmanouil Benetos (2), Mathieu Lagrange (3) ((1) IRCCyN, (2) QMUL, (3) LS2N)
Journal-ref: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017), Sep 2017, Mohonk, United States
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Machine Learning (stat.ML)
[7]  arXiv:1711.05355 (cross-list from eess.AS) [pdf, other]
Title: Automatic Conflict Detection in Police Body-Worn Video
Comments: 5 pages, 2 figures, 1 table
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Machine Learning (stat.ML)

Wed, 15 Nov 2017

[8]  arXiv:1711.04480 [pdf, other]
Title: Audio-to-score alignment of piano music using RNN-based automatic music transcription
Comments: 6 pages, 5 figures, The paper was published in SMC 2017 proceedings, Proceedings of 14th Sound and Music Computing Conference (SMC). 2017
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[9]  arXiv:1711.04121 [pdf, other]
Title: Unsupervised Audio Source Separation via Spectrum Energy Preserved Wasserstein Learning
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[10]  arXiv:1711.04845 (cross-list from stat.ML) [pdf, other]
Title: Invariances and Data Augmentation for Supervised Music Transcription
Comments: 6 pages
Subjects: Machine Learning (stat.ML); Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[11]  arXiv:1711.04351 (cross-list from eess.AS) [pdf, other]
Title: Automatic detection of alarm sounds in a noisy hospital environment using model and non-model based approaches
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[12]  arXiv:1711.04347 (cross-list from eess.AS) [pdf]
Title: Deep Networks tag the location of bird vocalisations on audio spectrograms
Comments: arXiv admin note: substantial text overlap with arXiv:1609.08408
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Sound (cs.SD)
[13]  arXiv:1711.04022 (cross-list from cs.LG) [pdf, other]
Title: Deep Within-Class Covariance Analysis for Acoustic Scene Classification
Comments: 5 pages, 1 figure
Subjects: Learning (cs.LG); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Fri, 10 Nov 2017

[14]  arXiv:1711.03280 (cross-list from cs.LG) [pdf, ps, other]
Title: Crafting Adversarial Examples For Speech Paralinguistics Applications
Subjects: Learning (cs.LG); Cryptography and Security (cs.CR); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)

Thu, 9 Nov 2017

[15]  arXiv:1711.03037 [pdf, other]
Title: A joint separation-classification model for sound event detection of weakly labelled data
Comments: Submitted to ICASSP 2018, source code available
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[ total of 15 entries: 1-15 ]
[ showing up to 25 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)