MSCCL++: Rethinking GPU Communication Abstractions for AI Inference
Changho Hwang, Peng Cheng, Roshan Dathathri, Abhinav Jangda, Saeed Maleki, Madan Musuvathi, Olli Saarikivi, Aashaka Shah, Ziyue Yang, Sreevatsa Anantharamu, Mahdieh Ghazimirsaeed, Jithin Jose, Binyang Li, Caio Rocha, Qinghua Zhou
In the ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
Pittsburgh, USA, March 2026 (to appear)
Ranggi Hwang, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, and Mao Yang
In the 51st International Symposium on Computer Architecture (ISCA)
Buenos Aires, Argentina, June 2024
Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, and Yongqiang Xiong
In the 6th Conference on Machine Learning and Systems (MLSys)
Miami, FL, June 2023
Changho Hwang, KyoungSoo Park, Ran Shu, Xinyuan Qu, Peng Cheng, and Yongqiang Xiong
In the 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI)
Boston, MA, April 2023
Changho Hwang, Taehyun Kim, Sunghyun Kim, Jinwoo Shin, and KyoungSoo Park
In the 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI)
Virtual Event, April 2021
Kimin Lee, Changho Hwang, KyoungSoo Park, and Jinwoo Shin
In the 34th International Conference on Machine Learning (ICML)
Sydney, Australia, August 2017
Younghwan Go, Muhammad Jamshed, YoungGyoun Moon, Changho Hwang, and KyoungSoo Park
In the 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI)
Boston, MA, March 2017