Update README.md
Browse files
README.md
CHANGED
@@ -55,7 +55,7 @@ Install **vLLM** and
|
|
55 |
run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):
|
56 |
|
57 |
```
|
58 |
-
python -m vllm.entrypoints.openai.api_server --model cortecs/Llama-3.3-70B-Instruct-FP8-Dynamic
|
59 |
```
|
60 |
Access the model:
|
61 |
```
|
|
|
55 |
run the [server](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#openai-compatible-server):
|
56 |
|
57 |
```
|
58 |
+
python -m vllm.entrypoints.openai.api_server --model cortecs/Llama-3.3-70B-Instruct-FP8-Dynamic --max-model-len 9000 --gpu-memory-utilization 0.95
|
59 |
```
|
60 |
Access the model:
|
61 |
```
|