Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 3254 | 2018 |
Tacotron: Towards End-to-End Speech Synthesis Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 2459* | 2017 |
Tacotron: Towards end-to-end speech synthesis Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 2182 | 2017 |
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International Conference on Machine Learning, 5180-5189, 2018 | 969 | 2018 |
Towards end-to-end prosody transfer for expressive speech synthesis with tacotron RJ Skerry-Ryan, E Battenberg, Y Xiao, Y Wang, D Stanton, J Shor, ... international conference on machine learning, 4693-4702, 2018 | 687 | 2018 |
Tacotron: A fully end-to-end text-to-speech synthesis model Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135 164, 2017 | 287 | 2017 |
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning Y Zhang, RJ Weiss, H Zen, Y Wu, Z Chen, RJ Skerry-Ryan, Y Jia, ... arXiv preprint arXiv:1907.04448, 2019 | 192 | 2019 |
Predicting Expressive Speaking Style from Text in End-To-End Speech Synthesis D Stanton, Y Wang, RJ Skerry-Ryan 2018 IEEE Spoken Language Technology Workshop (SLT), 595-602, 2018 | 146 | 2018 |
Semi-supervised training for improving data efficiency in end-to-end speech synthesis YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 141 | 2019 |
Location-relative attention mechanisms for robust long-form speech synthesis E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 131 | 2020 |
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis RJ Weiss, RJ Skerry-Ryan, E Battenberg, S Mariooryad, DP Kingma ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 119 | 2021 |
Uncovering Latent Style Factors for Expressive Speech Synthesis Y Wang, RJ Skerry-Ryan, Y Xiao, D Stanton, J Shor, E Battenberg, ... arXiv preprint arXiv:1711.00520, 2017 | 88 | 2017 |
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu arXiv preprint arXiv:2103.14574, 2021 | 68 | 2021 |
Synthesizing speech from text using neural networks Y Wu, J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, ... US Patent 10,971,170, 2021 | 59 | 2021 |
Semi-Supervised Generative Modeling for Controllable Speech Synthesis R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ... arXiv preprint arXiv:1910.01709, 2019 | 59 | 2019 |
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ... arXiv preprint arXiv:1906.03402, 2019 | 58 | 2019 |
Organic indoor location discovery S Teller, J Battat, B Charrow, D Curtis, R Ryan, J Ledlie, J Hicks Computer Science and Artificial Intelligence Laboratory Technical Report 75, 16, 2008 | 29 | 2008 |
Non-saturating GAN training as divergence minimization M Shannon, B Poole, S Mariooryad, T Bagby, E Battenberg, D Kao, ... arXiv preprint arXiv:2010.08029, 2020 | 21 | 2020 |
Identifying entities using search results TA Lasko, A Tomkins, M Angelo, MK Gray, R Ryan, NU Godbole, ... US Patent 8,856,099, 2014 | 21 | 2014 |
Anatomy of a subway hack R Ryan, Z Anderson, A Chiesa 16th DEFCON Hacking Conference (DEFCON 2008), 2008 | 20 | 2008 |