Hetero-mark, a benchmark suite for CPU-GPU collaborative computing Y Sun, X Gong, AK Ziabari, L Yu, X Li, S Mukherjee, C McCardwell, ... 2016 IEEE International Symposium on Workload Characterization (IISWC), 1-10, 2016 | 139 | 2016 |
MGPUSim: Enabling multi-GPU performance modeling and optimization Y Sun, T Baruah, SA Mojumder, S Dong, X Gong, S Treadway, Y Bao, ... Proceedings of the 46th International Symposium on Computer Architecture …, 2019 | 113 | 2019 |
A comprehensive performance analysis of HSA and OpenCL 2.0 S Mukherjee, Y Sun, P Blinzer, AK Ziabari, D Kaeli 2016 IEEE International Symposium on Performance Analysis of Systems and …, 2016 | 54 | 2016 |
Profiling dnn workloads on a volta-based dgx-1 system SA Mojumder, MS Louis, Y Sun, AK Ziabari, JL Abellán, J Kim, D Kaeli, ... 2018 IEEE International Symposium on Workload Characterization (IISWC), 122-133, 2018 | 53 | 2018 |
Asymmetric NoC architectures for GPU systems AK Ziabari, JL Abellán, Y Ma, A Joshi, D Kaeli Proceedings of the 9th International Symposium on Networks-on-Chip, 1-8, 2015 | 52 | 2015 |
Leveraging silicon-photonic noc for designing scalable gpus AKK Ziabari, JL Abellán, R Ubal, C Chen, A Joshi, D Kaeli Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015 | 44 | 2015 |
UMH: A hardware-based unified memory hierarchy for systems with multiple discrete GPUs AK Ziabari, Y Sun, Y Ma, D Schaa, JL Abellán, R Ubal, J Kim, A Joshi, ... ACM Transactions on Architecture and Code Optimization (TACO) 13 (4), 1-25, 2016 | 39 | 2016 |
TwinKernels: an execution model to improve GPU hardware scheduling at compile time X Gong, Z Chen, AK Ziabari, R Ubal, D Kaeli 2017 IEEE/ACM International Symposium on Code Generation and Optimization …, 2017 | 24 | 2017 |
REMAP: A reliability/endurance mechanism for advancing PCM MK Tavana, AK Ziabari, M Arjomand, M Kandemir, C Das, D Kaeli Proceedings of the International Symposium on Memory Systems, 385-398, 2017 | 18 | 2017 |
Quantifying the energy efficiency of FFT on heterogeneous platforms Y Ukidave, AK Ziabari, P Mistry, G Schirner, D Kaeli 2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013 | 16 | 2013 |
Analyzing power efficiency of optimization techniques and algorithm design methods for applications on heterogeneous platforms Y Ukidave, AK Ziabari, P Mistry, G Schirner, D Kaeli The International journal of high performance computing applications 28 (3 …, 2014 | 12 | 2014 |
Live together or die alone: Block cooperation to extend lifetime of resistive memories MK Tavana, AK Ziabari, D Kaeli Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 …, 2017 | 9 | 2017 |
Visualization of OpenCL application execution on CPU-GPU systems AK Ziabari, R Ubal, D Schaa, D Kaeli Proceedings of the Workshop on Computer Architecture Education, 1-8, 2015 | 6 | 2015 |
Block cooperation: Advancing lifetime of resistive memories by increasing utilization of error correcting codes MK Tavana, AK Ziabari, D Kaeli ACM Transactions on Architecture and Code Optimization (TACO) 15 (3), 1-26, 2018 | 5 | 2018 |
A framework for visualization of OpenCL applications execution: a tutorial AK Ziabari, RU Tena, D Schaa, D Kaeli Proceedings of the 3rd International Workshop on OpenCL, 1-2, 2015 | 2 | 2015 |
Improving the global memory efficiency in GPU-based systems AK Ziabari Northeastern University, 2016 | 1 | 2016 |
A Framework for Visualization of OpenCL Applications Execution A Ziabari, R Ubal, D Schaa, D Kaeli | | |
ICCD 2020 D Kaeli, T Carlson, M Geier, Y Sun, J Su, O Plata, H Jiang, G Byrd, ... | | |
Evaluation of Volta-based DGX-1 System Using DNN Workloads SA Mojumder, MS Louis, Y Sun, AK Ziabari, JL Abellán, J Kim, D Kaeli, ... | | |