Stebėti
Cody (Hao) Yu
Cody (Hao) Yu
Founding Engineer at Boson AI | ex-Amazonian | UCLA PhD ‘19
Patvirtintas el. paštas boson.ai - Pagrindinis puslapis
Pavadinimas
Cituota
Cituota
Metai
Automated systolic array architecture synthesis for high throughput CNN inference on FPGAs
X Wei, CH Yu, P Zhang, Y Chen, Y Wang, H Hu, Y Liang, J Cong
Proceedings of the 54th Annual Design Automation Conference 2017, 1-6, 2017
4452017
Ansor: Generating {High-Performance} tensor programs for deep learning
L Zheng, C Jia, M Sun, Z Wu, CH Yu, A Haj-Ali, Y Wang, J Yang, D Zhuo, ...
14th USENIX symposium on operating systems design and implementation (OSDI …, 2020
2852020
Efficient memory management for large language model serving with pagedattention
W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng, CH Yu, J Gonzalez, H Zhang, ...
Proceedings of the 29th Symposium on Operating Systems Principles, 611-626, 2023
1342023
Programming and runtime support to blaze FPGA accelerator deployment at datacenter scale
M Huang, D Wu, CH Yu, Z Fang, M Interlandi, T Condie, J Cong
Proceedings of the Seventh ACM Symposium on Cloud Computing, 456-469, 2016
1112016
HeteroCL: A multi-paradigm programming infrastructure for software-defined reconfigurable computing
YH Lai, Y Chi, Y Hu, J Wang, CH Yu, Y Zhou, J Cong, Z Zhang
Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019
982019
TGPA: Tile-grained pipeline architecture for low latency CNN inference
X Wei, Y Liang, X Li, CH Yu, P Zhang, J Cong
2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2018
722018
Automated accelerator generation and optimization with composable, parallel and pipeline architecture
J Cong, P Wei, CH Yu, P Zhang
Proceedings of the 55th Annual Design Automation Conference, 1-6, 2018
702018
The SMEM Seeding Acceleration for DNA Sequence Alignment
MCF Chang, YT Chen, J Cong, PT Huang, CL Kuo, CH Yu
The 24th IEEE International Symposium on Field-Programmable Custom Computing …, 2016
612016
Bandwidth Optimization Through On-Chip Memory Restructuring for HLS
J Cong, P Wei, CH Yu, P Zhou
602017
AutoDSE: Enabling software programmers to design efficient FPGA accelerators
A Sohrabizadeh, CH Yu, M Gao, J Cong
ACM Transactions on Design Automation of Electronic Systems (TODAES) 27 (4 …, 2022
572022
S2FA: An accelerator automation framework for heterogeneous computing in datacenters
CH Yu, P Wei, M Grossman, P Zhang, V Sarker, J Cong
Proceedings of the 55th Annual Design Automation Conference, 1-6, 2018
412018
On the preconditioner of conjugate gradient method: a power grid simulation perspective
CH Chou, NY Tsai, H Yu, CR Lee, Y Shi, SC Chang
Proceedings of the International Conference on Computer-Aided Design, 494-497, 2010
372010
Best-effort FPGA programming: A few steps can go a long way
J Cong, Z Fang, Y Hao, P Wei, CH Yu, C Zhang, P Zhou
arXiv preprint arXiv:1807.01340, 2018
332018
Tensorir: An abstraction for automatic tensorized program optimization
S Feng, B Hou, H Jin, W Lin, J Shao, R Lai, Z Ye, L Zheng, CH Yu, Y Yu, ...
Proceedings of the 28th ACM International Conference on Architectural …, 2023
252023
Analysis and optimization of the implicit broadcasts in FPGA HLS to improve maximum frequency
L Guo, J Lau, Y Chi, J Wang, CH Yu, Z Chen, Z Zhang, J Cong
2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020
232020
Latte: Locality Aware Transformation for High-Level Synthesis
J Cong, P Wei, CH Yu, P Zhou
232018
Heterogeneous datacenters: Options and opportunities
J Cong, M Huang, D Wu, CH Yu
Proceedings of the 53rd Annual Design Automation Conference, 1-6, 2016
232016
Useful-skew clock optimization for multi-power mode designs
HM Chou, H Yu, SC Chang
2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 647-650, 2011
222011
DietCode: Automatic optimization for dynamic tensor programs
B Zheng, Z Jiang, CH Yu, H Shen, J Fromm, Y Liu, Y Wang, L Ceze, ...
Proceedings of Machine Learning and Systems 4, 848-863, 2022
212022
Customizable computing—from single chip to datacenters
J Cong, Z Fang, M Huang, P Wei, D Wu, CH Yu
Proceedings of the IEEE 107 (1), 185-203, 2018
172018
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–20