Update README.md
Browse files
README.md
CHANGED
@@ -11,6 +11,9 @@ tags:
|
|
11 |
base_model:
|
12 |
- Qwen/Qwen2.5-72B-Instruct
|
13 |
---
|
|
|
|
|
|
|
14 |
# Athene-V2-Chat-72B: Rivaling GPT-4o across Benchmarks
|
15 |
|
16 |
<p align="center">
|
|
|
11 |
base_model:
|
12 |
- Qwen/Qwen2.5-72B-Instruct
|
13 |
---
|
14 |
+
> [!NOTE]
|
15 |
+
> EXL2 4.65bpw-h6 quantized version of [Nexusflow/Athene-V2-Chat](https://huggingface.co/Nexusflow/Athene-V2-Chat). Supports 32K context with Q4 cache on systems with 48 GB VRAM.
|
16 |
+
|
17 |
# Athene-V2-Chat-72B: Rivaling GPT-4o across Benchmarks
|
18 |
|
19 |
<p align="center">
|