Hubert: Self-supervised speech representation learning by masked prediction of hidden units WN Hsu, B Bolte, YHH Tsai, K Lakhotia, R Salakhutdinov, A Mohamed IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3451-3460, 2021 | 755 | 2021 |
Multimodal transformer for unaligned multimodal language sequences YHH Tsai, S Bai, PP Liang, JZ Kolter, LP Morency, R Salakhutdinov Proceedings of the conference. Association for Computational Linguistics …, 2019 | 664 | 2019 |
Learning factorized multimodal representations YHH Tsai, PP Liang, A Zadeh, LP Morency, R Salakhutdinov arXiv preprint arXiv:1806.06176, 2018 | 263 | 2018 |
Learning cross-domain landmarks for heterogeneous domain adaptation YHH Tsai, YR Yeh, YCF Wang Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 183 | 2016 |
Learning cross-domain landmarks for heterogeneous domain adaptation YHH Tsai, YR Yeh, YCF Wang Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 183 | 2016 |
Learning robust visual-semantic embeddings YH Hubert Tsai, LK Huang, R Salakhutdinov Proceedings of the IEEE International conference on Computer Vision, 3571-3580, 2017 | 172 | 2017 |
Transformer Dissection: A Unified Understanding of Transformer's Attention via the Lens of Kernel YHH Tsai, S Bai, M Yamada, LP Morency, R Salakhutdinov arXiv preprint arXiv:1908.11775, 2019 | 124 | 2019 |
Self-supervised learning from a multi-view perspective YHH Tsai, Y Wu, R Salakhutdinov, LP Morency arXiv preprint arXiv:2006.05576, 2020 | 113 | 2020 |
Unsupervised domain adaptation with label and structural consistency CA Hou, YHH Tsai, YR Yeh, YCF Wang IEEE Transactions on Image Processing 25 (12), 5552-5562, 2016 | 106 | 2016 |
HuBERT: How much can a bad teacher benefit ASR pre-training? WN Hsu, YHH Tsai, B Bolte, R Salakhutdinov, A Mohamed ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 103 | 2021 |
Video relationship reasoning using gated spatio-temporal energy graph YHH Tsai, S Divvala, LP Morency, R Salakhutdinov, A Farhadi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 90 | 2019 |
Capsules with inverted dot-product attention routing YHH Tsai, N Srivastava, H Goh, R Salakhutdinov arXiv preprint arXiv:2002.04764, 2020 | 69 | 2020 |
Unsupervised domain adaptation with imbalanced cross-domain data TMH Hsu, WY Chen, CA Hou, YHH Tsai, YR Yeh, YCF Wang Proceedings of the IEEE International Conference on Computer Vision, 4121-4129, 2015 | 69 | 2015 |
Transfer neural trees for heterogeneous domain adaptation WY Chen, TMH Hsu, YHH Tsai, YCF Wang, MS Chen Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016 | 59 | 2016 |
Learning representations from imperfect time series data via tensor rank regularization PP Liang, Z Liu, YHH Tsai, Q Zhao, R Salakhutdinov, LP Morency arXiv preprint arXiv:1907.01011, 2019 | 54 | 2019 |
Improving one-shot learning through fusing side information YHH Tsai, R Salakhutdinov arXiv preprint arXiv:1710.08347, 2017 | 52 | 2017 |
Multimodal routing: Improving local and global interpretability of multimodal language analysis YHH Tsai, MQ Ma, M Yang, R Salakhutdinov, LP Morency Proceedings of the Conference on Empirical Methods in Natural Language …, 2020 | 48 | 2020 |
Strong and simple baselines for multimodal utterance embeddings PP Liang, YC Lim, YHH Tsai, R Salakhutdinov, LP Morency arXiv preprint arXiv:1906.02125, 2019 | 31 | 2019 |
Self-supervised representation learning with relative predictive coding YHH Tsai, MQ Ma, M Yang, H Zhao, LP Morency, R Salakhutdinov arXiv preprint arXiv:2103.11275, 2021 | 25 | 2021 |
Recognizing heterogeneous cross-domain data via generalized joint distribution adaptation YT Hsieh, SY Tao, YHH Tsai, YR Yeh, YCF Wang 2016 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2016 | 25 | 2016 |