Deep learning recommendation model for personalization and recommendation systems M Naumov, D Mudigere, HJM Shi, J Huang, N Sundaraman, J Park, ... arXiv preprint arXiv:1906.00091, 2019 | 502 | 2019 |
Machine learning at facebook: Understanding inference at the edge CJ Wu, D Brooks, K Chen, D Chen, S Choudhury, M Dukhan, ... 2019 IEEE international symposium on high performance computer architecture …, 2019 | 463 | 2019 |
Mlperf inference benchmark VJ Reddi, C Cheng, D Kanter, P Mattson, G Schmuelling, CJ Wu, ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020 | 354 | 2020 |
SHiP: Signature-based hit predictor for high performance caching CJ Wu, A Jaleel, W Hasenplaugh, M Martonosi, SC Steely Jr, J Emer Proceedings of the 44th Annual IEEE/ACM International Symposium on …, 2011 | 317 | 2011 |
Mlperf training benchmark P Mattson, C Cheng, G Diamos, C Coleman, P Micikevicius, D Patterson, ... Proceedings of Machine Learning and Systems 2, 336-349, 2020 | 277 | 2020 |
The architectural implications of facebook's dnn-based personalized recommendation U Gupta, CJ Wu, X Wang, M Naumov, B Reagen, D Brooks, B Cottel, ... 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020 | 246 | 2020 |
MCM-GPU: Multi-chip-module GPUs for continued performance scalability A Arunkumar, E Bolotin, B Cho, U Milic, E Ebrahimi, O Villa, A Jaleel, ... ACM SIGARCH Computer Architecture News 45 (2), 320-332, 2017 | 209 | 2017 |
Chasing carbon: The elusive environmental footprint of computing U Gupta, YG Kim, S Lee, J Tse, HHS Lee, GY Wei, D Brooks, CJ Wu 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021 | 156 | 2021 |
Recnmp: Accelerating personalized recommendation with near-memory processing L Ke, U Gupta, BY Cho, D Brooks, V Chandra, U Diril, A Firoozshahian, ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020 | 151 | 2020 |
PACMan: prefetch-aware cache management for high performance caching CJ Wu, A Jaleel, M Martonosi, SC Steely Jr, J Emer Proceedings of the 44th Annual IEEE/ACM International Symposium on …, 2011 | 151 | 2011 |
Deeprecsys: A system for optimizing end-to-end at-scale neural recommendation inference U Gupta, S Hsia, V Saraph, X Wang, B Reagen, GY Wei, HHS Lee, ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020 | 143 | 2020 |
MLPerf: An industry standard benchmark suite for machine learning performance P Mattson, VJ Reddi, C Cheng, C Coleman, G Diamos, D Kanter, ... IEEE Micro 40 (2), 8-16, 2020 | 136 | 2020 |
Sustainable ai: Environmental implications, challenges and opportunities CJ Wu, R Raghavendra, U Gupta, B Acun, N Ardalani, K Maeng, G Chang, ... Proceedings of Machine Learning and Systems 4, 795-813, 2022 | 131 | 2022 |
CAWA: Coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads SY Lee, A Arunkumar, CJ Wu ACM SIGARCH Computer Architecture News 43 (3S), 515-527, 2015 | 107 | 2015 |
CAWS: Criticality-aware warp scheduling for GPGPU workloads SY Lee, CJ Wu Proceedings of the 23rd international conference on Parallel architectures …, 2014 | 94 | 2014 |
Quantifying the Energy Cost of Data Movement for Emerging Smart Phone Workloads on Mobile Platforms D Pandiyan, CJ Wu Workload Characterization (IISWC), 2014 IEEE International Symposium on, 2014 | 85 | 2014 |
Performance, energy characterizations and architectural implications of an emerging mobile platform benchmark suite-mobilebench D Pandiyan, SY Lee, CJ Wu 2013 IEEE International Symposium on Workload Characterization (IISWC), 133-142, 2013 | 85 | 2013 |
A Study of Mobile Device Utilization C Gao, A Gutierrez, M Rajan, RG Dreslinski, T Mudge, CJ Wu | 72 | 2015 |
Characterization and dynamic mitigation of intra-application cache interference CJ Wu, M Martonosi (IEEE ISPASS) IEEE International Symposium on Performance Analysis of …, 2011 | 71 | 2011 |
Understanding training efficiency of deep learning recommendation models at scale B Acun, M Murphy, X Wang, J Nie, CJ Wu, K Hazelwood 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021 | 67 | 2021 |