<progress id="hpfzt"><pre id="hpfzt"></pre></progress>
        <ruby id="hpfzt"></ruby>
        <big id="hpfzt"><p id="hpfzt"></p></big>

        <progress id="hpfzt"></progress>

        <big id="hpfzt"><pre id="hpfzt"></pre></big>

        <i id="hpfzt"></i>

            <strike id="hpfzt"><video id="hpfzt"><ins id="hpfzt"></ins></video></strike>
            <dl id="hpfzt"></dl>
                Mirror operated in collaboration with local support

                Sound

                Authors and titles for recent submissions

                [ total of 65 entries: 1-25 | 26-50 | 51-65 ]
                [ showing 25 entries per page: fewer | more | all ]

                Tue, 26 May 2020

                [1]  arXiv:2005.12230 [pdf, other]
                Title: Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features
                Comments: 5 pages, 3 figures
                Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
                [2]  arXiv:2005.12195 [pdf, other]
                Title: End-to-End Auditory Object Recognition via Inception Nucleus
                Comments: Published In proceedings of ICASSP 2020
                Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
                [3]  arXiv:2005.11459 [pdf, other]
                Title: Power Pooling Operators and Confidence Learning for Semi-Supervised Sound Event Detection
                Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
                [4]  arXiv:2005.11950 (cross-list from eess.AS) [pdf]
                Title: An End-to-End Mispronunciation Detection System for L2 English Speech Leveraging Novel Anti-Phone Modeling
                Comments: 5 pages, 2 figures, Submitted to INTERSPEECH 2020
                Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
                [5]  arXiv:2005.11769 (cross-list from eess.AS) [pdf, other]
                Title: Lite Audio-Visual Speech Enhancement
                Comments: Submitted to Interspeech 2020
                Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
                [6]  arXiv:2005.11682 (cross-list from eess.AS) [pdf, other]
                Title: Glottal source estimation robustness: A comparison of sensitivity of voice source estimation techniques
                Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
                [7]  arXiv:2005.11676 (cross-list from cs.CL) [pdf, other]
                Title: Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
                Comments: Submitted to INTERSPEECH 2020
                Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
                [8]  arXiv:2005.11612 (cross-list from eess.AS) [pdf, other]
                Title: Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation
                Comments: Submitted to Interspeech2020
                Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
                [9]  arXiv:2005.11611 (cross-list from eess.AS) [pdf, other]
                Title: Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks
                Comments: Submitted to Interspeech2020
                Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
                [10]  arXiv:2005.11371 (cross-list from eess.AS) [pdf, other]
                Title: Speaker diarization with session-level speaker embedding refinement using graph neural networks
                Comments: ICASSP 2020 (45th International Conference on Acoustics, Speech, and Signal Processing)
                Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)

                Mon, 25 May 2020

                [11]  arXiv:2005.10929 [pdf, other]
                Title: Large scale evaluation of importance maps in automatic speech recognition
                Comments: submitted to INTERSPEECH 2020
                Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
                [12]  arXiv:2005.11185 (cross-list from cs.CL) [pdf, other]
                Title: Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection
                Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
                [13]  arXiv:2005.11184 (cross-list from cs.CL) [pdf, other]
                Title: End-to-end Named Entity Recognition from English Speech
                Comments: submitted to Interspeech-2020
                Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
                [14]  arXiv:2005.11172 (cross-list from eess.AS) [pdf, other]
                Title: Deep Reinforcement Learning with Pre-training for Time-efficient Training of Automatic Speech Recognition
                Comments: arXiv admin note: substantial text overlap with arXiv:1910.11256
                Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
                [15]  arXiv:2005.11138 (cross-list from eess.AS) [pdf, other]
                Title: TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids
                Comments: First four authors contributed equally. For audio samples, see this https URL
                Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
                [16]  arXiv:2005.11129 (cross-list from eess.AS) [pdf, other]
                Title: Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
                Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
                [17]  arXiv:2005.11004 (cross-list from eess.AS) [pdf, other]
                Title: NAUTILUS: a Versatile Voice Cloning System
                Comments: Submitted to The IEEE/ACM Transactions on Audio, Speech, and Language Processing
                Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)

                Fri, 22 May 2020 (showing first 8 of 19 entries)

                [18]  arXiv:2005.10637 [pdf, other]
                Title: Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition
                Comments: 5 pages, 2 figures
                Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
                [19]  arXiv:2005.10539 [pdf, other]
                Title: An approach to Beethoven's 10th Symphony
                Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
                [20]  arXiv:2005.10480 [pdf, other]
                Title: Understanding the Importance of Heart Sound Segmentation for Heart Anomaly Detection
                Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Quantitative Methods (q-bio.QM)
                [21]  arXiv:2005.10463 [pdf, other]
                Title: Simplified Self-Attention for Transformer-based End-to-End Speech Recognition
                Comments: Submitted to Interspeech2020
                Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
                [22]  arXiv:2005.10438 [pdf, other]
                Title: Conversational End-to-End TTS for Voice Agent
                Comments: 5 pages
                Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
                [23]  arXiv:2005.10803 (cross-list from eess.AS) [pdf, other]
                Title: Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism
                Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
                [24]  arXiv:2005.10627 (cross-list from eess.AS) [pdf, other]
                Title: Dynamic Sparsity Neural Networks for Automatic Speech Recognition
                Comments: Submitted to INTERSPEECH 2020
                Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
                [25]  arXiv:2005.10548 (cross-list from eess.AS) [pdf, other]
                Title: Coswara -- A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis
                Comments: A description of Coswara dataset to evaluate COVID-19 diagnosis using respiratory sounds
                Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
                [ total of 65 entries: 1-25 | 26-50 | 51-65 ]
                [ showing 25 entries per page: fewer | more | all ]
                ƴϷ