Spaces:
Running
Running
Add arXiv link
Browse files
app.py
CHANGED
@@ -301,7 +301,7 @@ def grade(file_obj, progress=gr.Progress()):
|
|
301 |
model_result_example = "https://raw.githubusercontent.com/yuweihao/MM-Vet/main/v2/results/gpt-4o-2024-05-13_detail-high.json"
|
302 |
|
303 |
markdown = f"""
|
304 |
-
# [MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities](https://
|
305 |
|
306 |
We offer MM-Vet v2 LLM-based (GPT-4) evaluator to grade open-ended outputs from your models.
|
307 |
|
|
|
301 |
model_result_example = "https://raw.githubusercontent.com/yuweihao/MM-Vet/main/v2/results/gpt-4o-2024-05-13_detail-high.json"
|
302 |
|
303 |
markdown = f"""
|
304 |
+
# [MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities](https://arxiv.org/abs/2408.00765)
|
305 |
|
306 |
We offer MM-Vet v2 LLM-based (GPT-4) evaluator to grade open-ended outputs from your models.
|
307 |
|