Qingxiao Sun, Super Scientific Software Laboratory, China University of Petroleum-Beijing

Qingxiao Sun

Associate Professor

qingxiao.sun@cup.edu.cn

Short Bio

Qingxiao Sun is currently an associate professor at the China University of Petroleum-Beijing. He was awarded with ACM SIGHPC China Doctoral Disseration Award and CCF TCARCH Doctoral Disseration Award. He received his PhD in 2023 from Beihang University under supervision of Prof. Yi Liu and Asso. Prof. Hailong Yang. His research interests include high performance computing, computer architecture, deep learning system and parallel computing. His recent research invetigates performance auto-tuning, GPU architecture extension, runtime mechanism and graph neural network training. He has authored about 20 publications in the leading international journals and conferences. His papers have been selected as CLUSTER '21 Best Paper Nomination and IEEE Computer's "Spotlight on Transactions". He currently serves as reviewers in the premier journals including TPDS, TC, TCC and THPC.

Publications

[TPDS] Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian. Adaptive Auto-tuning Framework for Global Exploration of Stencil Optimization on GPUs. IEEE Transactions on Parallel and Distributed Systems. 2024. [PDF] [DOI] [Bibtex] [Code]
[IPDPS '23] Jianjin Liao, Mingzhen Li, Hailong Yang, Qingxiao Sun, Biao Sun, Jiwei Hao, Tianyu Feng, Fengwei Yu, Shengdong Chen, Ye Tao, Zicheng Zhang, Zhongzhi Luan, Depei Qian. Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU. 37th IEEE International Parallel and Distributed Processing Symposium. 2023. [PDF] [Slides] [DOI] [Bibtex] [Code]
[SC '22] Qingxiao Sun, Yi Liu, Hailong Yang, Ruizhe Zhang, Ming Dun, Mingzhen Li, Xiaoyan Liu, Wencong Xiao, Yong Li, Zhongzhi Luan, Depei Qian. CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs. 35th International Conference for High Performance Computing, Networking, Storage, and Analysis. 2022. [PDF] [Slides] [DOI] [Bibtex] [Code]
[IPDPS '22] Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Zhongzhi Luan, Depei Qian. StencilMART: Predicting Optimization Selection for Stencil Computations Across GPUs. 36th IEEE International Parallel and Distributed Processing Symposium. 2022. [PDF] [Slides] [DOI] [Bibtex] [Code]
[HPCC '22] Jiwei Hao, Hailong Yang, Qingxiao Sun, Huaitao Zhang, Zhongzhi Luan, Depei Qian. Towards Optimized Streaming Tensor Completion on Multiple GPUs. 24th IEEE International Conference on High Performance Computing and Communications. 2022. [PDF] [Slides] [DOI] [Bibtex] [Code]
[PARCO] Qingxiao Sun, Liu Yi, Hailong Yang, Mingzhen Li, Zhongzhi Luan, Depei Qian. QoS-aware Dynamic Resource Allocation with Improved Utilization and Energy Efficiency on GPU. Parallel Computing. 2022. [PDF] [DOI] [Bibtex] [Code]
[TC] Qingxiao Sun, Yi Liu, Hailong Yang, Ming Dun, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian. Input-Aware Sparse Tensor Storage Format Selection for Optimizing MTTKRP. IEEE Transactions on Computers. 2021. IEEE Computer's "Spotlight on Transactions" column. [PDF] [DOI] [Bibtex] [Code]
[ICS '21] Ming Dun, Yunchun Li, Hailong Yang, Qingxiao Sun, Zhongzhi Luan, Depei Qian. An Optimized Tensor Completion Library for Multiple GPUs. 35th ACM International Conference on Supercomputing. 2021. [PDF] [Slides] [DOI] [Bibtex] [Code]
[ICPP '21] Mingzhen Li, Yi Liu, Hailong Yang, Yongmin Hu, Qingxiao Sun, Bangduo Chen, Xin You, Xiaoyan Liu, Zhongzhi Luan, Depei Qian. Automatic Code Generation and Optimization of Large-scale Stencil Computation on Many-core Processors. 50th International Conference on Parallel Processing. 2021. [PDF] [Slides] [DOI] [Bibtex] [Code]
[CLUSTER '21] Qingxiao Sun, Yi Liu, Hailong Yang, Zhonghui Jiang, Xiaoyan Liu, Ming Dun, Zhongzhi Luan, Depei Qian. csTuner: Scalable Auto-tuning Framework for Complex Stencil Computation on GPUs. 23rd IEEE International Conference on Cluster Computing. 2021. Best Paper Finalist. [PDF] [Slides] [DOI] [Bibtex] [Code]
[SC '20] Qingxiao Sun, Yi Liu, Ming Dun, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian. SpTFS: Sparse Tensor Format Selection for MTTKRP via Deep Learning. 33th International Conference for High Performance Computing, Networking, Storage, and Analysis. 2020. [PDF] [Slides] [DOI] [Bibtex] [Code]
[TPDS] Mingzhen Li , Yi Liu , Xiaoyan Liu, Qingxiao Sun, Xin You, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian. The Deep Learning Compiler: A Comprehensive Survey. IEEE Transactions on Parallel and Distributed Systems. 2020. [PDF] [DOI] [Bibtex] [Code]
[Information Sciences] Ming Dun, Yunchun Li, Qingxiao Sun, Hailong Yang, Wei Li, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian. Towards Efficient Canonical Polyadic Decomposition on Sunway Many-core Processor. Information Sciences. 2020. [PDF] [DOI] [Bibtex] [Code]
[FGCS] Zhiyong Xiao, Xu Liu, Jingheng Xu, Qingxiao Sun, Lin Gan. Highly Scalable Parallel Genetic Algorithm on Sunway Many-core Processors. Future Generation Computer Systems. 2020. [PDF] [DOI] [Bibtex] [Code]
[CLUSTER '19] Qingxiao Sun, Yi Liu, Hailong Yang, Zhongzhi Luan, Depei Qian. SMQoS: Improving Utilization and Energy Efficiency with QoS Awareness on GPUs. 21st IEEE International Conference on Cluster Computing. 2019. [PDF] [Slides] [DOI] [Bibtex] [Code]
[ICA3PP '19] Ming Dun, Yunchun Li, Xin You, Qingxiao Sun, Zerong Luan, Hailong Yang. Accelerating De Novo Assembler WTDBG2 on Commodity Servers. 19th IEEE International Conference on Algorithms and Architectures for Parallel Processing. 2019. [PDF] [Slides] [DOI] [Bibtex] [Code]
[TACO] Chao Yu, Yuebin Bai, Qingxiao Sun, Hailong Yang. Improving Thread-level Parallelism in GPUs Through Expanding Register File to Scratchpad Memory. ACM Transactions on Architecture and Code Optimization. 2018. [PDF] [DOI] [Bibtex] [Code]