Multi2Sim: A simulation framework for CPU-GPU computing R Ubal, B Jang, P Mistry, D Schaa, D Kaeli Proceedings of the 21st international conference on Parallel architectures …, 2012 | 642 | 2012 |
Multi2sim: A simulation framework to evaluate multicore-multithreaded processors R Ubal, J Sahuquillo, S Petit, P Lopez 19th International Symposium on Computer Architecture and High Performance …, 2007 | 251 | 2007 |
MGPUSim: Enabling multi-GPU performance modeling and optimization Y Sun, T Baruah, SA Mojumder, S Dong, X Gong, S Treadway, Y Bao, ... Proceedings of the 46th International Symposium on Computer Architecture …, 2019 | 113 | 2019 |
Statistical fault injection-based AVF analysis of a GPU architecture N Farazmand, R Ubal, D Kaeli Proceedings of SELSE 12, 1-6, 2012 | 60 | 2012 |
Leveraging silicon-photonic noc for designing scalable gpus AKK Ziabari, JL Abellán, R Ubal, C Chen, A Joshi, D Kaeli Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015 | 44 | 2015 |
UMH: A hardware-based unified memory hierarchy for systems with multiple discrete GPUs AK Ziabari, Y Sun, Y Ma, D Schaa, JL Abellán, R Ubal, J Kim, A Joshi, ... ACM Transactions on Architecture and Code Optimization (TACO) 13 (4), 1-25, 2016 | 39 | 2016 |
Multi2Sim Kepler: A detailed architectural GPU simulator X Gong, R Ubal, D Kaeli 2017 IEEE International Symposium on Performance Analysis of Systems and …, 2017 | 36 | 2017 |
A complexity-effective out-of-order retirement microarchitecture SP Marti, JS Borras, PL Rodriguez, RU Tena, JD Marin IEEE Transactions on computers 58 (12), 1626-1639, 2009 | 29 | 2009 |
TwinKernels: an execution model to improve GPU hardware scheduling at compile time X Gong, Z Chen, AK Ziabari, R Ubal, D Kaeli 2017 IEEE/ACM International Symposium on Code Generation and Optimization …, 2017 | 24 | 2017 |
Exploring the heterogeneous design space for both performance and reliability R Ubal, D Schaa, P Mistry, X Gong, Y Ukidave, Z Chen, G Schirner, ... Proceedings of the 51st Annual Design Automation Conference, 1-6, 2014 | 20 | 2014 |
Mgsim+ mgmark: A framework for multi-gpu system research Y Sun, T Baruah, SA Mojumder, S Dong, R Ubal, X Gong, S Treadway, ... arXiv preprint arXiv:1811.02884, 2018 | 14 | 2018 |
Leakage current reduction in data caches on embedded systems R Ubal, J Sahuquillo, S Petit, H Hassan, P Lopez The 2007 International Conference on Intelligent Pervasive Computing (IPC …, 2007 | 13 | 2007 |
Efficient register renaming and recovery for high-performance processors S Petit, R Ubal, J Sahuquillo, P López IEEE Transactions on Very Large Scale Integration (VLSI) Systems 22 (7 …, 2013 | 11 | 2013 |
Hardware support for local memory transactions on gpu architectures A Villegas, A Navarro, R Asenjo, O Plata, R Ubal, D Kaeli Proc. ACM SIGPLAN Workshop Transactional Comput, 1-9, 2015 | 8 | 2015 |
The multi2sim simulation framework: A cpu-gpu model for heterogeneous computing R Ubal, J Sahuquillo, S Petit, P Lopez, Z Chen, DR Kaeli | 8 | 2015 |
Power reduction in advanced embedded ipc processors R Ubal, J Sahuquillo, S Petit, H Hassan, P López Intelligent Automation & Soft Computing 15 (3), 495-507, 2009 | 8 | 2009 |
Visualization of OpenCL application execution on CPU-GPU systems AK Ziabari, R Ubal, D Schaa, D Kaeli Proceedings of the Workshop on Computer Architecture Education, 1-8, 2015 | 6 | 2015 |
The multi2sim simulation framework R Ubal, B Jang, P Mistry, D Sachaa, D Kaeli International Conference on Parallel Architectures and Compilation …, 2010 | 6 | 2010 |
An efficient low-complexity alternative to the rob for out-of-order retirement of instructions S Petit, R Ubal, J Sahuquillo, P Lopez, J Duato 2009 12th Euromicro Conference on Digital System Design, Architectures …, 2009 | 6 | 2009 |
Hardware support for scratchpad memory transactions on GPU architectures A Villegas, R Asenjo, A Navarro, O Plata, R Ubal, D Kaeli Euro-Par 2017: Parallel Processing: 23rd International Conference on …, 2017 | 5 | 2017 |