wolfram commited on
Commit
1502954
1 Parent(s): 894fb07

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -11,6 +11,9 @@ tags:
11
  base_model:
12
  - Qwen/Qwen2.5-72B-Instruct
13
  ---
 
 
 
14
  # Athene-V2-Chat-72B: Rivaling GPT-4o across Benchmarks
15
 
16
  <p align="center">
 
11
  base_model:
12
  - Qwen/Qwen2.5-72B-Instruct
13
  ---
14
+ > [!NOTE]
15
+ > EXL2 4.65bpw-h6 quantized version of [Nexusflow/Athene-V2-Chat](https://huggingface.co/Nexusflow/Athene-V2-Chat). Supports 32K context with Q4 cache on systems with 48 GB VRAM.
16
+
17
  # Athene-V2-Chat-72B: Rivaling GPT-4o across Benchmarks
18
 
19
  <p align="center">