Feature request
Hello! I recently came across this popular OpenAI-compatible inference framework and found it very interesting. I'd like to know more about its concurrency and stability—specifically, how it compares to vLLM.
Motivation
project feature
Your contribution
no
Feature request
Hello! I recently came across this popular OpenAI-compatible inference framework and found it very interesting. I'd like to know more about its concurrency and stability—specifically, how it compares to vLLM.
Motivation
project feature
Your contribution
no