Snoopy04 commited on
Commit
3722358
1 Parent(s): 077a49d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -55,7 +55,7 @@ Install **vLLM** and
55
  run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):
56
 
57
  ```
58
- python -m vllm.entrypoints.openai.api_server --model cortecs/Llama-3.3-70B-Instruct-FP8-Dynamic
59
  ```
60
  Access the model:
61
  ```
 
55
  run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):
56
 
57
  ```
58
+ python -m vllm.entrypoints.openai.api_server --model cortecs/Llama-3.3-70B-Instruct-FP8-Dynamic --max-model-len 9000 --gpu-memory-utilization 0.95
59
  ```
60
  Access the model:
61
  ```