DiffWave: A versatile diffusion model for audio synthesis Z Kong, W Ping, J Huang, K Zhao, B Catanzaro ICLR 2021, 2020 | 1358 | 2020 |
Deep Voice 3: Scaling text-to-speech with convolutional sequence learning W Ping, K Peng, A Gibiansky, SO Arik, A Kannan, S Narang, J Raiman, ... ICLR 2018, 2017 | 904* | 2017 |
Deep Voice 2: Multi-speaker neural text-to-speech α-β, S Arik, G Diamos, A Gibiansky, J Miller, K Peng, W Ping, J Raiman, ... NeurIPS 2017, 2017 | 658* | 2017 |
Neural voice cloning with a few samples S Arik*, J Chen*, K Peng*, W Ping*, Y Zhou NeurIPS, 2018 | 464 | 2018 |
ClariNet: Parallel wave generation in end-to-end text-to-speech W Ping, K Peng, J Chen ICLR 2019, 2018 | 424 | 2018 |
BigVGAN: A universal neural vocoder with large-scale training S Lee, W Ping, B Ginsburg, B Catanzaro, S Yoon ICLR 2023, 2022 | 200 | 2022 |
Vila: On pre-training for visual language models J Lin, H Yin, W Ping, P Molchanov, M Shoeybi, S Han CVPR 2024, 2023 | 187 | 2023 |
On fast sampling of diffusion probabilistic models Z Kong, W Ping ICML 2021 Workshop on Invertible Neural Networks, Normalizing Flows, and …, 2021 | 184 | 2021 |
Non-autoregressive neural text-to-speech K Peng, W Ping, Z Song, K Zhao ICML 2020, 2019 | 169* | 2019 |
Factuality enhanced language models for open-ended text generation N Lee, W Ping, P Xu, M Patwary, M Shoeybi, B Catanzaro NeurIPS 2022, 2022 | 165 | 2022 |
WaveFlow: A compact flow-based model for raw audio W Ping, K Peng, K Zhao, Z Song ICML 2020, 2020 | 150 | 2020 |
Long-short transformer: Efficient transformers for language and vision C Zhu, W Ping, C Xiao, M Shoeybi, T Goldstein, A Anandkumar, ... NeurIPS 2021, 2021 | 137 | 2021 |
Cancer metastasis detection with neural conditional random field Y Li, W Ping Medical Imaging with Deep Learning, 2018 | 133 | 2018 |
Retrieval meets long context large language models P Xu, W Ping, X Wu, L McAfee, C Zhu, Z Liu, S Subramanian, ... ICLR 2024, 2023 | 110 | 2023 |
End-to-end training of neural retrievers for open-domain question answering DS Sachan, M Patwary, M Shoeybi, N Kant, W Ping, WL Hamilton, ... ACL 2021, 2021 | 98 | 2021 |
Topic compositional neural language model W Wang, Z Gan, W Wang, D Shen, J Huang, W Ping, S Satheesh, L Carin AISTATS 2018, 2017 | 93 | 2017 |
One TTS alignment to rule them all R Badlani, A Łancucki, KJ Shih, R Valle, W Ping, B Catanzaro ICASSP 2022, 2021 | 86 | 2021 |
Systems and methods for multi-speaker neural text-to-speech G DIAMOS, A GIBIANSKY, J Miller, P Kainan, P Wei, J RAIMAN, Z Yanqi US Patent 10,896,669, 2021 | 85 | 2021 |
Speech denoising in the waveform domain with self-attention Z Kong, W Ping, A Dantrey, B Catanzaro ICASSP 2022, 2022 | 70 | 2022 |
Exploring the limits of domain-adaptive training for detoxifying large-scale language models B Wang, W Ping, C Xiao, P Xu, M Patwary, M Shoeybi, B Li, ... NeurIPS 2022, 2022 | 61 | 2022 |