hi, is it possible to have a speculative config with vllm backend? if yes are there any examples to run it? Thanks
hi,
is it possible to have a speculative config with vllm backend? if yes are there any examples to run it?
Thanks