A Multi-task Learning Framework for Emotion Recognition Using 2D Continuous Space R Xia, Y Liu Transaction of Affective Computing, 2015 | 215 | 2015 |
Introducing shared-hidden-layer autoencoders for transfer learning and their application in acoustic emotion recognition J Deng, R Xia, Z Zhang, Y Liu, B Schuller 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 97 | 2014 |
Using i-Vector Space Model for Emotion Recognition. R Xia, Y Liu Interspeech, 2230-2233, 2012 | 87 | 2012 |
Modeling gender information for emotion recognition using denoising autoencoder R Xia, J Deng, B Schuller, Y Liu 2014 IEEE International conference on acoustics, speech and signal …, 2014 | 61 | 2014 |
Sentence level emotion recognition based on decisions from subsentence segments JH Jeon, R Xia, Y Liu 2011 IEEE international conference on acoustics, speech and signal …, 2011 | 57 | 2011 |
Multimodal relational tensor network for sentiment and emotion classification S Sahay, SH Kumar, R Xia, J Huang, L Nachman arXiv preprint arXiv:1806.02923, 2018 | 47 | 2018 |
Using denoising autoencoder for emotion recognition. R Xia, Y Liu Interspeech, 2886-2889, 2013 | 44 | 2013 |
Efficient neural music generation MWY Lam, Q Tian, T Li, Z Yin, S Feng, M Tu, Y Ji, R Xia, M Ma, X Song, ... Advances in Neural Information Processing Systems 36, 2024 | 38 | 2024 |
DBN-Ivector Framework for Acoustic Emotion Recognition R Xia, Y Liu INTERSPEECH, 480-484, 2016 | 31 | 2016 |
Separate anything you describe X Liu, Q Kong, Y Zhao, H Liu, Y Yuan, Y Liu, R Xia, Y Wang, MD Plumbley, ... arXiv preprint arXiv:2308.05037, 2023 | 30 | 2023 |
Leveraging valence and activation information via multi-task learning for categorical emotion recognition R Xia, Y Liu Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International …, 2015 | 21 | 2015 |
Level of interest sensing in spoken dialog using multi-level fusion of acoustic and lexical evidence JH Jeon, R Xia, Y Liu Eleventh Annual Conference of the International Speech Communication Association, 2010 | 20 | 2010 |
Pretraining conformer with asr for speaker verification D Cai, W Wang, M Li, R Xia, C Huang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 17 | 2023 |
A preliminary study of cross-lingual emotion recognition from speech: automatic classification versus human perception. JH Jeon, D Le, R Xia, Y Liu Interspeech, 2837-2840, 2013 | 17 | 2013 |
Speech enhancement with weakly labelled data from audioset Q Kong, H Liu, X Du, L Chen, R Xia, Y Wang arXiv preprint arXiv:2102.09971, 2021 | 16 | 2021 |
Noise robust tts for low resource speakers using pre-trained model and speech enhancement D Dai, L Chen, Y Wang, M Wang, R Xia, X Song, Z Wu, Y Wang arXiv preprint arXiv:2005.12531, 2020 | 11 | 2020 |
Cloning one’s voice using very limited data in the wild D Dai, Y Chen, L Chen, M Tu, L Liu, R Xia, Q Tian, Y Wang, Y Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |
Level of interest sensing in spoken dialog using decision-level fusion of acoustic and lexical evidence JH Jeon, R Xia, Y Liu Computer Speech & Language 28 (2), 420-433, 2014 | 8 | 2014 |
Audio Prompt Tuning for Universal Sound Separation Y Liu, X Liu, Y Zhao, Y Wang, R Xia, P Tain, Y Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Seed-asr: Understanding diverse speech and contexts with llm-based speech recognition Y Bai, J Chen, J Chen, W Chen, Z Chen, C Ding, L Dong, Q Dong, Y Du, ... arXiv preprint arXiv:2407.04675, 2024 | 3 | 2024 |