Qwen
/

Qwen3-235B-A22B-Thinking-2507

Text Generation

Model card Files Files and versions

hzhwcmhf commited on Aug 17

Commit

6cbffae

·

verified ·

1 Parent(s): ddee1c5

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -249,9 +249,9 @@ After updating the config, proceed with either **vLLM** or **SGLang** for servin
 To run Qwen with 1M context support:
 ```bash
-git clone https://github.com/vllm-project/vllm.git
-cd vllm
-pip install -e .
 ```
 Then launch the server with Dual Chunk Flash Attention enabled:

 To run Qwen with 1M context support:
 ```bash
+pip install -U vllm \
+    --torch-backend=auto \
+    --extra-index-url https://wheels.vllm.ai/nightly
 ```
 Then launch the server with Dual Chunk Flash Attention enabled: