ScalAna: Automating scaling loss detection with graph analysis Y Jin, H Wang, T Yu, X Tang, T Hoefler, X Liu, J Zhai SC20: International Conference for High Performance Computing, Networking
, 2020 | 12 | 2020 |
PerFlow: A domain specific framework for automatic performance analysis of parallel applications Y Jin, H Wang, R Zhong, C Zhang, J Zhai Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of
, 2022 | 10 | 2022 |
Vapro: Performance variance detection and diagnosis for production-run parallel applications L Zheng, J Zhai, X Tang, H Wang, T Yu, Y Jin, SL Song, W Chen Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of
, 2022 | 8 | 2022 |
WiseGraph: Optimizing GNN with Joint Workload Partition of Graph and Operations K Huang, J Zhai, L Zheng, H Wang, Y Jin, Q Zhang, R Zhang, Z Zheng, ... Proceedings of the Nineteenth European Conference on Computer Systems, 1-17, 2024 | 3 | 2024 |
Identifying scalability bottlenecks for large-scale parallel programs with graph analysis Y Jin, H Wang, X Tang, T Hoefler, X Liu, J Zhai Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of
, 2020 | 3 | 2020 |
Detecting performance variance for parallel applications without source code J Zhai, L Zheng, F Zhang, X Tang, H Wang, T Yu, Y Jin, SL Song, W Chen IEEE Transactions on Parallel and Distributed Systems 33 (12), 4239-4255, 2022 | 2 | 2022 |
Parallel program scalability bottleneck detection method and computing device J Zhai, Y Jin, W Chen, W Zheng US Patent 11,768,754, 2023 | 1 | 2023 |
Lightweight Noise Detection J Zhai, Y Jin, W Chen, W Zheng Performance Analysis of Parallel Applications for HPC, 165-197, 2023 | 1 | 2023 |
Unified Programming Models for Heterogeneous High-Performance Computers ZX Ma, YY Jin, SZ Tang, HJ Wang, WC Xue, JD Zhai, WM Zheng Journal of Computer Science and Technology 38 (1), 211-218, 2023 | 1 | 2023 |
Leveraging Graph Analysis to Pinpoint Root Causes of Scalability Issues for Parallel Applications Y Jin, H Wang, X Tang, Z Guo, Y Zhao, T Hoefler, T Liu, X Liu, J Zhai IEEE Transactions on Parallel and Distributed Systems, 2024 | | 2024 |
Efficient Inference for Pruned CNN Models on Mobile Devices With Holistic Sparsity Alignment Y Jin, R Zhong, S Long, J Zhai IEEE Transactions on Parallel and Distributed Systems, 2024 | | 2024 |
BoostN: Optimizing Imbalanced Neighborhood Communication on Homogeneous Many-Core System H Huang, Y Jin, W Xue Proceedings of the 53rd International Conference on Parallel Processing, 262-272, 2024 | | 2024 |
Graph-Centric Performance Analysis for Large-Scale Parallel Applications Y Jin, H Wang, R Zhong, C Zhang, X Liao, F Zhang, J Zhai IEEE Transactions on Parallel and Distributed Systems, 2024 | | 2024 |
Efficient Asynchronous Performance Prediction for Heterogeneous Systems Y JIN, Z MA, J ZHAI Chinese Journal of Computational Physics 41 (1), 40, 2024 | | 2024 |
{PUZZLE}: Efficiently Aligning Large Language Models through {Light-Weight} Context Switch K Lei, Y Jin, M Zhai, K Huang, H Ye, J Zhai 2024 USENIX Annual Technical Conference (USENIX ATC 24), 127-140, 2024 | | 2024 |
Performance Analysis of Parallel Applications for HPC J Zhai, Y Jin, W Chen, W Zheng Springer, 2023 | | 2023 |
Graph Analysis for Scalability Analysis J Zhai, Y Jin, W Chen, W Zheng Performance Analysis of Parallel Applications for HPC, 101-128, 2023 | | 2023 |
Informed Memory Access Monitoring J Zhai, Y Jin, W Chen, W Zheng Performance Analysis of Parallel Applications for HPC, 73-97, 2023 | | 2023 |
Background and Overview J Zhai, Y Jin, W Chen, W Zheng Performance Analysis of Parallel Applications for HPC, 1-5, 2023 | | 2023 |
Fast Communication Trace Collection J Zhai, Y Jin, W Chen, W Zheng Performance Analysis of Parallel Applications for HPC, 9-41, 2023 | | 2023 |