Stebėti
Jiaao He
Jiaao He
Patvirtintas el. paštas mails.tsinghua.edu.cn - Pagrindinis puslapis
Pavadinimas
Cituota
Cituota
Metai
Fastmoe: A fast mixture-of-expert training system
J He, J Qiu, A Zeng, Z Yang, J Zhai, J Tang
arXiv preprint arXiv:2103.13262, 2021
932021
Prague: High-performance heterogeneity-aware asynchronous decentralized training
Q Luo, J He, Y Zhuo, X Qian
Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020
862020
Fastermoe: modeling and optimizing training of large-scale dynamic pre-trained models
J He, J Zhai, T Antunes, H Wang, F Luo, S Shi, Q Li
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
642022
BaGuaLu: targeting brain scale pretrained models with over 37 million cores
Z Ma, J He, J Qiu, H Cao, Y Wang, Z Sun, L Zheng, H Wang, S Tang, ...
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
562022
SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online Parallelization
M Zhai, J He, Z Ma, Z Zong, R Zhang, J Zhai
2023 USENIX Annual Technical Conference (USENIX ATC 23), 961-975, 2023
202023
FastDecode: High-Throughput LLM Serving through Disaggregating Attention Computation
J He, K Huang, J Zhai
First Workshop on Long-Context Foundation Models@ ICML 2024, 0
9*
Efficiently emulating high-bitwidth computation with low-bitwidth hardware
Z Ma, H Wang, G Feng, C Zhang, L Xie, J He, S Chen, J Zhai
Proceedings of the 36th ACM International Conference on Supercomputing, 1-12, 2022
42022
POSTER: Pattern-Aware Sparse Communication for Scalable Recommendation Model Training
J He, S Chen, J Zhai
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024
22024
Critique of “Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility” by SCC Team From Tsinghua University
C Zhang, C Zhao, J He, S Chen, L Zheng, K Huang, W Han, J Zhai
IEEE Transactions on Parallel and Distributed Systems 32 (11), 2631-2634, 2021
22021
Student Cluster Competition 2018, Team Tsinghua University: Reproducing performance of multi-physics simulations of the Tsunamigenic 2004 Sumatra megathrust earthquake on the …
J He, C Zhao, J Yu, X Yu, L Zheng, C Lou, S Tang, W Han, J Zhai
Parallel Computing 90, 102570, 2019
2019
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–10