Skip to content
Snippets Groups Projects
Unverified Commit abc9c9c3 authored by Ren Xuancheng's avatar Ren Xuancheng Committed by GitHub
Browse files

add vllm version warning (#1043)

parent e885e98f
No related branches found
No related tags found
No related merge requests found
...@@ -172,6 +172,9 @@ Qwen2.5 is supported by multiple inference frameworks. Here we demonstrate the u ...@@ -172,6 +172,9 @@ Qwen2.5 is supported by multiple inference frameworks. Here we demonstrate the u
### vLLM ### vLLM
> [!Warning]
> Not working with vllm 0.6.3.
We advise you to use the latest version of vLLM to build OpenAI-compatible API service, including tool use support. Start the server with a chat model, e.g. `Qwen2.5-7B-Instruct`: We advise you to use the latest version of vLLM to build OpenAI-compatible API service, including tool use support. Start the server with a chat model, e.g. `Qwen2.5-7B-Instruct`:
```shell ```shell
vllm serve Qwen/Qwen2.5-7B-Instruct vllm serve Qwen/Qwen2.5-7B-Instruct
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment