Sgap: towards efficient sparse tensor algebra compilation for GPU G Zhang, Y Zhao, Y Tao, Z Yu, G Dai, S Huang, Y Wen, P Petoumenos, ... CCF Transactions on High Performance Computing, 1-18, 2023 | 4 | 2023 |
HyperGef: A Framework Enabling Efficient Fusion for Hypergraph Neural Network on GPUs Z Yu, G Dai, S Yang, G Zhang, H Zhang, F Zhu, J Yang, J Zhao, Y Wang Proceedings of Machine Learning and Systems 5, 2023 | 3 | 2023 |
Canvas: End-to-End Kernel Architecture Search in Neural Networks C Zhao, G Zhang, M Gao arXiv preprint arXiv:2304.07741, 2023 | 1 | 2023 |
FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning K Zhong, Z Zhu, G Dai, H Wang, X Yang, H Zhang, J Si, Q Mao, S Zeng, ... Proceedings of the 29th ACM International Conference on Architectural …, 2024 | | 2024 |
CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models JY Lee, D Lee, G Zhang, M Tiwari, A Mirhoseini arXiv preprint arXiv:2404.08763, 2024 | | 2024 |
Compilation of Modular and General Sparse Workspaces G Zhang, O Hsu, F Kjolstad arXiv preprint arXiv:2404.04541, 2024 | | 2024 |
GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU Z Yu, G Zhang, H Huang, X Chen, J Zhao arXiv preprint arXiv:2404.03019, 2024 | | 2024 |