Gheorghe-Teodor Bercea
Gheorghe-Teodor Bercea
Research Staff Member, IBM Research
Verified email at ibm.com
Title
Cited by
Cited by
Year
Firedrake: automating the finite element method by composing abstractions
F Rathgeber, DA Ham, L Mitchell, M Lange, F Luporini, ATT McRae, ...
ACM Transactions on Mathematical Software (TOMS) 43 (3), 1-27, 2016
3852016
Cross-loop optimization of arithmetic intensity for finite element local assembly
F Luporini, AL Varbanescu, F Rathgeber, GT Bercea, J Ramanujam, ...
ACM Transactions on Architecture and Code Optimization (TACO) 11 (4), 1-25, 2015
722015
Automated generation and symbolic manipulation of tensor product finite elements
ATT McRae, GT Bercea, L Mitchell, DA Ham, CJ Cotter
SIAM Journal on Scientific Computing 38 (5), S25-S47, 2016
562016
Offloading support for OpenMP in Clang and LLVM
SF Antao, A Bataev, AC Jacob, GT Bercea, AE Eichenberger, G Rokos, ...
2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), 1-11, 2016
462016
Integrating GPU support for OpenMP offloading directives into Clang
C Bertolli, SF Antao, GT Bercea, AC Jacob, AE Eichenberger, T Chen, ...
Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in …, 2015
382015
A structure-exploiting numbering algorithm for finite elements on extruded meshes, and its performance evaluation in Firedrake. Geoscientific Model Development, 9 (10): 3803 …
GT Bercea, ATT McRae, DA Ham, L Mitchell, F Rathgeber, L Nardi, ...
gmd-9-3803-2016, 2016
34*2016
Performance analysis of OpenMP on a GPU using a CORAL proxy application
GT Bercea, C Bertolli, SF Antao, AC Jacob, AE Eichenberger, T Chen, ...
Proceedings of the 6th International Workshop on Performance Modeling …, 2015
322015
Performance analysis and optimization of Clang's OpenMP 4.5 GPU support
M Martineau, S McIntosh-Smith, C Bertolli, AC Jacob, SF Antao, ...
2016 7th International Workshop on Performance Modeling, Benchmarking and …, 2016
242016
Generalizing run-time tiling with the loop chain abstraction
MM Strout, F Luporini, CD Krieger, C Bertolli, GT Bercea, C Olschanowsky, ...
2014 IEEE 28th International Parallel and Distributed Processing Symposium …, 2014
182014
Early experiences porting three applications to OpenMP 4.5
I Karlin, T Scogland, AC Jacob, SF Antao, GT Bercea, C Bertolli, ...
International Workshop on OpenMP, 281-292, 2016
172016
COFFEE: an optimizing compiler for finite element local assembly
F Luporini, AL Varbanescu, F Rathgeber, GT Bercea, J Ramanujam, ...
arXiv preprint arXiv:1407.0904, 2014
132014
Efficient fork-join on GPUs through warp specialization
AC Jacob, AE Eichenberger, H Sung, SF Antao, GT Bercea, C Bertolli, ...
2017 IEEE 24th International Conference on High Performance Computing (HiPC …, 2017
102017
Implementing implicit OpenMP data sharing on GPUs
GT Bercea, C Bertolli, AC Jacob, A Eichenberger, A Bataev, G Rokos, ...
Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in …, 2017
62017
Compiling ONNX Neural Network Models Using MLIR
T Jin, GT Bercea, TD Le, T Chen, G Su, H Imai, Y Negishi, A Leu, ...
arXiv preprint arXiv:2008.08272, 2020
42020
firedrake: An automated finite element system
L Mitchell, DA Ham, F Rathgeber, M Homolya, ATT McRae, GT Bercea, ...
Zenodo, 2016
32016
Towards performance portable gpu programming with raja
A Jacob, SF Antao, H Sung, AE Eichenberger, C Bertolli, GT Bercea, ...
Workshop on Portability Among HPC Architectures for Scientific Applications, 2015
22015
An open-source solution to performance portability for Summit and Sierra supercomputers
GT Bercea, A Bataev, AE Eichenberger, C Bertolli, JK O'Brien
IBM Journal of Research and Development 64 (3/4), 12: 1-12: 23, 2019
12019
Firedrake: Re-imagining FEniCS by Composing Domain-specific Abstractions
F Rathgeber, L Mitchell, D Ham, M Lange, A McRae, F Luporini, G Bercea, ...
12014
Hybrid CPU/GPU tasks optimized for concurrency in OpenMP
AE Eichenberger, GT Bercea, A Bataev, L Grinberg, JK O'Brien
IBM Journal of Research and Development 64 (3/4), 13: 1-13: 14, 2019
2019
Sublinear Subwindow Search
M Reuter, GT Bercea
arXiv preprint arXiv:1908.00140, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–20