Xupeng Miao,
Gabriele Oliaro,
Zhihao Zhang,
Xinhao Cheng,
Zeyu Wang,
Zhengxin Zhang,
Rae Ying Yee Wong,
Alan Zhu,
Lijie Yang,
Xiaoxiang Shi,
Chunan Shi,
Zhuoming Chen,
Daiyaan Arfeen,
Reyna Abhyankar,
Zhihao Jia
(2024).
SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification.
ASPLOS 2024.
Muyan Hu,
Ashwin Venkatram,
Shreyashri Biswas,
Balamurugan Marimuthu,
Bohan Hou,
Gabriele Oliaro,
Haojie Wang,
Liyan Zheng,
Xupeng Miao,
Jidong Zhai,
Zhihao Jia
(2024).
Optimal Kernel Orchestration for Tensor Programs with Korch.
ASPLOS 2024.
Jonatan Langlet,
Ran Ben Basat,
Gabriele Oliaro,
Michael Mitzenmacher,
Minlan Yu,
Gianni Antichi
(2023).
Direct Telemetry Access.
SIGCOMM 2023.