Improving performance of sparse matrix dense matrix multiplication on large-scale parallel systems S Acer, O Selvitopi, C Aykanat Parallel Computing 59, 71-96, 2016 | 51 | 2016 |
Kokkos kernels: Performance portable sparse/dense linear algebra and graph kernels S Rajamanickam, S Acer, L Berger-Vergiat, V Dang, N Ellingwood, ... arXiv preprint arXiv:2103.11991, 2021 | 37 | 2021 |
Improving medium-grain partitioning for scalable sparse tensor decomposition S Acer, T Torun, C Aykanat IEEE Transactions on Parallel and Distributed Systems 29 (12), 2814-2825, 2018 | 18 | 2018 |
Scalable triangle counting on distributed-memory systems S Acer, A Yaşar, S Rajamanickam, M Wolf, ÜV Catalyürek 2019 IEEE High Performance Extreme Computing Conference (HPEC), 1-5, 2019 | 17 | 2019 |
Sphynx: A parallel multi-GPU graph partitioner for distributed-memory systems S Acer, EG Boman, CA Glusa, S Rajamanickam Parallel Computing 106, 102769, 2021 | 16 | 2021 |
A recursive hypergraph bipartitioning framework for reducing bandwidth and latency costs simultaneously O Selvitopi, S Acer, C Aykanat IEEE Transactions on Parallel and Distributed Systems 28 (2), 345-358, 2016 | 15 | 2016 |
EXAGRAPH: Graph and combinatorial methods for enabling exascale applications S Acer, A Azad, EG Boman, A Buluç, KD Devine, SM Ferdous, ... The International Journal of High Performance Computing Applications 35 (6 …, 2021 | 14 | 2021 |
Partitioning models for general medium-grain parallel sparse tensor decomposition MO Karsavuran, S Acer, C Aykanat IEEE Transactions on Parallel and Distributed Systems 32 (1), 147-159, 2020 | 14 | 2020 |
SPHYNX: Spectral Partitioning for HYbrid aNd aXelerator-enabled systems S Acer, EG Boman, S Rajamanickam 2020 IEEE International Parallel and Distributed Processing Symposium …, 2020 | 14 | 2020 |
A recursive bipartitioning algorithm for permuting sparse square matrices into block diagonal form with overlap S Acer, E Kayaaslan, C Aykanat SIAM Journal on Scientific Computing 35 (1), C99-C121, 2013 | 13 | 2013 |
Performance-portable graph coarsening for efficient multilevel graph analysis MS Gilbert, S Acer, EG Boman, K Madduri, S Rajamanickam 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021 | 9 | 2021 |
Optimizing nonzero-based sparse matrix partitioning models via reducing latency S Acer, O Selvitopi, C Aykanat Journal of Parallel and Distributed Computing 122, 145-158, 2018 | 9 | 2018 |
A hypergraph partitioning model for profile minimization S Acer, E Kayaaslan, C Aykanat SIAM Journal on Scientific Computing 41 (1), A83-A108, 2019 | 7 | 2019 |
Reduce operations: Send volume balancing while minimizing latency MO Karsavuran, S Acer, C Aykanat IEEE Transactions on Parallel and Distributed Systems 31 (6), 1461-1473, 2020 | 4 | 2020 |
True load balancing for matricized tensor times Khatri-Rao product N Abubaker, S Acer, C Aykanat IEEE Transactions on Parallel and Distributed Systems 32 (8), 1974-1986, 2021 | 3 | 2021 |
Reordering sparse matrices into block-diagonal column-overlapped form S Acer, C Aykanat Journal of Parallel and Distributed Computing 140, 99-109, 2020 | 2 | 2020 |
A recursive graph bipartitioning algorithm by vertex separators with fixed vertices for permuting sparse matrices into block diagonal form with overlap S Acer PQDT-Global, 2011 | 2 | 2011 |
Addressing Volume and Latency Overheads in 1D-parallel Sparse Matrix-Vector Multiplication S Acer, O Selvitopi, C Aykanat Euro-Par 2017: Parallel Processing: 23rd International Conference on …, 2017 | 1 | 2017 |
Kokkos v. 4.0 G Mackey, A Powell, D Sunderland, M Hoemmen, S Rajamanickam, ... Sandia National Lab.(SNL-NM), Albuquerque, NM (United States), 2022 | | 2022 |
Sphynx: The first graph partitioner on AMD GPUs. S Acer, E Boman Sandia National Lab.(SNL-NM), Albuquerque, NM (United States), 2022 | | 2022 |