A survey on coarse-grained reconfigurable architectures from a performance perspective A Podobas, K Sano, S Matsuoka IEEE Access 8, 146719-146743, 2020 | 147 | 2020 |
Combined spatial and temporal blocking for high-performance stencil computation on FPGAs using OpenCL HR Zohouri, A Podobas, S Matsuoka Proceedings of the 2018 ACM/SIGDA International Symposium on Field …, 2018 | 109 | 2018 |
A comparison of some recent task-based parallel programming models A Podobas, M Brorsson, KF Faxén 3rd workshop on programmability issues for multi-core computers, 2010 | 73 | 2010 |
Hardware implementation of POSITs and their application in FPGAs A Podobas, S Matsuoka 2018 IEEE International Parallel and Distributed Processing Symposium …, 2018 | 66 | 2018 |
A review on parallel virtual screening softwares for high-performance computers NA Murugan, A Podobas, D Gadioli, E Vitali, G Palermo, S Markidis Pharmaceuticals 15 (1), 63, 2022 | 60 | 2022 |
Matrix engines for high performance computing: A paragon of performance or grasping at straws? J Domke, E Vatai, A Drozd, P ChenT, Y Oyama, L Zhang, S Salaria, ... 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021 | 39 | 2021 |
A template-based framework for exploring coarse-grained reconfigurable architectures A Podobas, K Sano, S Matsuoka 2020 IEEE 31st International Conference on Application-specific Systems …, 2020 | 38 | 2020 |
Neko: A modern, portable, and scalable framework for high-fidelity computational fluid dynamics N Jansson, M Karp, A Podobas, S Markidis, P Schlatter Computers & Fluids 275, 106243, 2024 | 34 | 2024 |
Grain graphs: OpenMP performance analysis made easy A Muddukrishna, PA Jonsson, A Podobas, M Brorsson Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of …, 2016 | 32 | 2016 |
Benchmarking the nvidia gpu lineage: From early k80 to modern a100 with asynchronous memory transfers M Svedin, SWD Chien, G Chikafa, N Jansson, A Podobas Proceedings of the 11th International Symposium on Highly Efficient …, 2021 | 29 | 2021 |
A comparative performance study of common and popular task‐centric programming frameworks A Podobas, M Brorsson, KF Faxén Concurrency and Computation: Practice and Experience 27 (1), 1-28, 2015 | 25* | 2015 |
Double-precision fpus in high-performance computing: an embarrassment of riches? J Domke, K Matsumura, M Wahib, H Zhang, K Yashima, T Tsuchikawa, ... 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 23 | 2019 |
High-performance high-order stencil computation on FPGAs using OpenCL HR Zohouri, A Podobas, S Matsuoka 2018 IEEE International Parallel and Distributed Processing Symposium …, 2018 | 22 | 2018 |
Evaluating high-level design strategies on FPGAs for high-performance computing A Podobas, HR Zohouri, N Maruyama, S Matsuoka 2017 27th International Conference on Field Programmable Logic and …, 2017 | 22 | 2017 |
tf-Darshan: Understanding fine-grained I/O performance in machine learning workloads SWD Chien, A Podobas, IB Peng, S Markidis 2020 IEEE International Conference on Cluster Computing (CLUSTER), 359-370, 2020 | 21 | 2020 |
Towards Unifying OpenMP Under the Task-Parallel Paradigm: Implementation and Performance of the taskloop Construct A Podobas, S Karlsson OpenMP: Memory, Devices, and Tasks: 12th International Workshop on OpenMP …, 2016 | 20 | 2016 |
MACC: An OpenACC transpiler for automatic multi-GPU use K Matsumura, M Sato, T Boku, A Podobas, S Matsuoka Supercomputing Frontiers: 4th Asian Conference, SCFA 2018, Singapore, March …, 2018 | 19 | 2018 |
TurboBŁYSK: scheduling for improved data-driven task performance with fast dependency resolution A Podobas, M Brorsson, V Vlassov Using and Improving OpenMP for Devices, Tasks, and More: 10th International …, 2014 | 18 | 2014 |
Strong scaling of OpenACC enabled Nek5000 on several GPU based HPC systems J Vincent, J Gong, M Karp, A Peplinski, N Jansson, A Podobas, A Jocksch, ... International Conference on High Performance Computing in Asia-Pacific …, 2022 | 17 | 2022 |
High-performance spectral element methods on field-programmable gate arrays: implementation, evaluation, and future projection M Karp, A Podobas, N Jansson, T Kenter, C Plessl, P Schlatter, S Markidis 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021 | 17 | 2021 |