MCM-GPU: Multi-Chip-Module GPUs for Continued Performance Scalability A Arunkumar, E Bolotin, B Cho, U Milic, E Ebrahimi, O Villa, A Jaleel, ... (ISCA) Proceedings of the 44th Annual International Symposium on Computer …, 2017 | 231 | 2017 |
CAWA: Coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads SY Lee, A Arunkumar, CJ Wu ACM SIGARCH Computer Architecture News 43 (3S), 515-527, 2015 | 114 | 2015 |
Beyond the socket: NUMA-aware GPUs U Milic, O Villa, E Bolotin, A Arunkumar, E Ebrahimi, A Jaleel, A Ramirez, ... Proceedings of the 50th Annual IEEE/ACM International Symposium on …, 2017 | 89 | 2017 |
Understanding the future of energy efficiency in multi-module gpus A Arunkumar, E Bolotin, D Nellans, CJ Wu 2019 IEEE International Symposium on High Performance Computer Architecture …, 2019 | 42 | 2019 |
Dora: Optimizing smartphone energy efficiency and web browser performance under interference D Shingari, A Arunkumar, B Gaudette, S Vrudhula, CJ Wu 2018 IEEE International Symposium on Performance Analysis of Systems and …, 2018 | 26 | 2018 |
Characterization and throttling-based mitigation of memory interference for heterogeneous smartphones D Shingari, A Arunkumar, CJ Wu 2015 IEEE International Symposium on Workload Characterization, 22-33, 2015 | 25 | 2015 |
Latte-cc: Latency tolerance aware adaptive cache compression management for energy efficient gpus A Arunkumar, SY Lee, V Soundararajan, CJ Wu 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 24 | 2018 |
ID-cache: instruction and memory divergence based cache management for GPUs A Arunkumar, SY Lee, CJ Wu 2016 IEEE international symposium on workload characterization (IISWC), 1-10, 2016 | 16 | 2016 |
E-ECC: Low Power Erasure and Error Correction Schemes for Increasing Reliability of Commodity DRAM Systems HM Chen, A Arunkumar, CJ Wu, T Mudge, C Chakrabarti | 15 | 2015 |
Investigation of ensemble features of self-supervised pretrained models for automatic speech recognition A Arunkumar, VN Sukhadia, S Umesh arXiv preprint arXiv:2206.05518, 2022 | 11 | 2022 |
Using low cost erasure and error correction schemes to improve reliability of commodity DRAM systems HM Chen, S Jeloka, A Arunkumar, D Blaauw, CJ Wu, T Mudge, ... IEEE Transactions on Computers 65 (12), 3766-3779, 2016 | 11 | 2016 |
Joint Encoder-Decoder Self-Supervised Pre-training for ASR. A Arunkumar, S Umesh Interspeech, 3418-3422, 2022 | 6 | 2022 |
ReMAP: Reuse and memory access cost aware eviction policy for last level cache management A Arunkumar, CJ Wu 2014 IEEE 32nd International Conference on Computer Design (ICCD), 110-117, 2014 | 6 | 2014 |
Estimating correlation for a real-time measure of connectivity A Arunkumar, A Panday, B Joshi, A Ravindran, HP Zaveri 2012 Annual International Conference of the IEEE Engineering in Medicine and …, 2012 | 5 | 2012 |
Snoop filter with stored replacement information, method for same, and system including victim exclusive cache and snoop filter shared replacement policies EC Quinnell, KC Heuer, T Nakra, A Arunkumar US Patent 10,360,158, 2019 | 2 | 2019 |
Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative Inference M Adnan, A Arunkumar, G Jain, PJ Nair, I Soloveychik, P Kamath arXiv preprint arXiv:2403.09054, 2024 | 1 | 2024 |
Selective speculative prefetch requests for a last-level cache T Nakra, A Arunkumar, P Moyer, J Fleischman US Patent App. 17/564,141, 2023 | 1 | 2023 |
Relaxed invalidation for cache coherence A Arunkumar, T Nakra, MV Kazakov, MN Nemlekar US Patent 11,960,399, 2024 | | 2024 |
Region pattern-matching hardware prefetcher GH Loh, M Scrbak, A Arunkumar, J Kalamatianos US Patent App. 17/957,795, 2024 | | 2024 |
Apparatus, system, and method for throttling prefetchers to prevent training on irregular memory accesses J Kalamatianos, M Scrbak, GH Loh, A Arunkumar US Patent App. 17/957,358, 2024 | | 2024 |