Gabriele Oliaro
Gabriele Oliaro
Home
Publications
Industry Experience
Contact
CV
Light
Dark
Automatic
3
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software. Create your slides in Markdown - click the Slides button to check out the dta.
Rui Pan
,
Yinwei Dai
,
Zhihao Zhang
,
Gabriele Oliaro
,
Zhihao Jia
,
Ravi Netravali
PDF
Cite
DOI
SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference
Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software. Create your slides in Markdown - click the Slides button to check out the dta.
Gabriele Oliaro
,
Zhihao Jia
,
Daniel Campos
,
Aurick Qiao
PDF
Cite
DOI
Cite
×