Bio
I am currently a postdoctoral research fellow at the National University of Singapore (NUS), collaborating with Prof. Jin Song Dong. I earned my Ph.D. from Shanghai Jiao Tong University under the supervision of Prof. Jingwen Leng.
My research interests include machine learning systems, high-performance computing, and computer architecture. I am passionate about exploring innovative solutions in these fields and contributing to the advancement of technology.
News
- [2025.09] Awarded the CCF Outstanding Doctoral Dissertation Award in Computer Architecture Nomination.
- [2025.09] Our two papers (ClusterFusion, Yggdrasil) are accepted by NeurIPS 2025.
- [2025.09] Our paper SWITCHBLADE is accepted by TCAD.
- [2025.06] Our paper Helix is accepted by SC 2025.
- [2025.04] Our paper Voyager is accepted by ASPLOS 2025.
- [2024.12] Started postdoc at NUS.
- [2024.11] Our paper VQ-LLM is accepted by HPCA 2025.
- [2023.09] I received my Ph.D. from Shanghai Jiao Tong University.
Education
- Ph.D., Shanghai Jiao Tong University, 2023
- B.S., Huazhong University of Science and Technology, 2018
Work Experience
- 2023.10 - 2024.10: System Developer/Researcher
- Tencent AI Lab
- Summer 2022: Research Intern
- Alibaba Cloud PAI
- Summer 2021: Research Intern
- Qi Zhi Institute
- Spring 2021: Research Intern
- Peng Cheng Laboratory
- Summer 2020: Research Intern
- T-Head Alibaba
Publications
[NeurIPS’25] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
Xinhao Luo, Zihan Liu*, Yangjie Zhou*, Shihan Fang, Ziyu Huang, Yu Feng, Chen Zhang, Shixuan Sun, Zhenzhe Zheng, Jingwen Leng, Minyi Guo
The Thirty-Ninth Annual Conference on Neural Information Processing Systems · (*Corresponding authors)[NeurIPS’25] Yggdrasil: Bridging Dynamic Speculation and Static Runtime for Latency-Optimal Tree-Based LLM Decoding
Yue Guan, Changming Yu, Shihan Fang, Weiming Hu, Zaifeng Pan, Zheng Wang, Zihan Liu, Yangjie Zhou, Yufei Ding, Minyi Guo, Jingwen Leng
The Thirty-Ninth Annual Conference on Neural Information Processing Systems[TCAD’25] A Full-Stack Framework for GNN Acceleration via Partition-Compiler-Architecture Co-Design
Yangjie Zhou, Zhihui Zhang, Shuwen Lu, Cong Guo, Jingwen Leng, Feng Zhang, Yufei Ma, Yun Liang, Minyi Guo
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems[ASPLOS’25] Voyager: Input-Adaptive Algebraic Transformations for High-Performance Graph Neural Networks
Yangjie Zhou, Wenting Shen, Jingwen Leng, Shuwen Lu, Zihan Liu, Weihao Cui, Zhendong Zhang, Wencong Xiao, Baole Ai, Yong Li, Wei Lin, Deze Zeng, Yun Liang, Quan Chen, Ning Liu, Minyi Guo
The 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems[SC’25] A Sample-Free Compilation Framework for Efficient Dynamic Tensor Computation
Yangjie Zhou, Honglin Zhu, Qian Qiu, Weihao Cui, Zihan Liu, Peng Chen, Mohamed Wahib, Cong Guo, Siyuan Feng, Jintao Meng, Haidong Lan, Jingwen Leng, Yun Lin, Jin Song Dong, Wenxi Zhu, Minwen Deng
The International Conference for High Performance Computing, Networking, Storage and Analysis[HPCA’25] VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference
Zihan Liu, Xinhao Luo, Junxian Guo, Wentao Ni, Yangjie Zhou, Yue Guan, Cong Guo, Weihao Cui, Yu Feng, Minyi Guo, Yuhao Zhu, Minjia Zhang, Jingwen Leng, Chen Jin
The 31st IEEE International Symposium on High-Performance Computer Architecture
[ASPLOS’24] Fractal: Joint Multi-Level Sparse Pattern Tuning of Accuracy and Performance for DNN Pruning
Yue Guan, Changming Yu, Yangjie Zhou, Jingwen Leng, Chao Li, Minyi Guo
The 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems[ASPLOS’23] uGrapher: High-Performance Graph Operator Computation via Unified Abstraction for Graph Neural Networks
Yangjie Zhou, Jingwen Leng, Yaoxu Song, Shuwen Lu, Mian Wang, Chao Li, Minyi Guo, Wenting Shen, Yong Li, Wei Lin, Xiangwen Liu, Hanqing Wu
The 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems[CF’23] AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs
Yangjie Zhou, Yaoxu Song, Jingwen Leng, Zihan Liu, Weihao Cui, Zhendong Zhang, Cong Guo, Quan Chen, Li Li, Minyi Guo
The 20th ACM International Conference on Computing Frontiers[CF’23] DistSim: A performance model of large-scale hybrid distributed DNN training
Guandong Lu, Runzhe Chen, Yakai Wang, Yangjie Zhou, Rui Zhang, Zheng Hu, Yanming Miao, Zhifang Cai, Li Li, Jingwen Leng, Minyi Guo
The 20th ACM International Conference on Computing Frontiers[Arxiv] Efficient Adaptive Activation Rounding for Post-Training Quantization
Zhengyi Li, Cong Guo, Zhanda Zhu, Yangjie Zhou, Yuxian Qiu, Xiaotian Gao, Jingwen Leng, Minyi Guo[IISWC’21] Characterizing and demystifying the implicit convolution algorithm on commercial matrix-multiplication accelerators
Yangjie Zhou, Mengtian Yang, Cong Guo, Jingwen Leng, Yun Liang, Quan Chen, Minyi Guo, Yuhao Zhu
IEEE International Symposium on Workload Characterization[DAC’20] TPUSim: ISA Design and Optimization for Fused Architecture Based Training Accelerator
Yangjie Zhou, Jingwen Leng, Mengtian Yang, Zhihui Zhang, Yakai Wang, Chen Zhang, Minyi Guo, Yuhao Zhu
ACM/IEEE Design Automation Conference (Poster)[DAC’20] Balancing efficiency and flexibility for DNN acceleration via temporal GPU-systolic array integration
Cong Guo, Yangjie Zhou, Jingwen Leng, Yuhao Zhu, Zidong Du, Quan Chen, Chao Li, Bin Yao, Minyi Guo
ACM/IEEE Design Automation Conference
Honors and Services
- 2025 CCF Outstanding Doctoral Dissertation Award in Computer Architecture Nomination
- 2025 International Conference on Computer Science and Application Engineering, Reviewer
- 2025 OSDI’25, Artifact Evaluation Committee
- 2025 ACM Transactions on Internet Technology, Reviewer
- 2024 OSDI’24 Artifact Evaluation Committee
- 2024 ATC’24 Artifact Evaluation Committee
- 2023 MLSys’23 Artifact Evaluation Committee
- 2022 SJTU JHC Excellent Doctoral Academic Forum First Prize Scholarship
- 2020 DAC’20 Richard Newton Young Student Fellow
