Publications

(2023). Direct Telemetry Access. In SIGCOMM ‘23.

PDF Cite Code DOI

(2023). SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification. In ArXiv.

PDF Cite Code DOI

(2021). Zero-CPU Collection with Direct Telemetry Access. In HotNets ‘21.

PDF Cite DOI