Spaces:
Running
Running
Update Space (evaluate main: 05209ece)
Browse files
README.md
CHANGED
@@ -10,6 +10,10 @@ pinned: false
|
|
10 |
tags:
|
11 |
- evaluate
|
12 |
- metric
|
|
|
|
|
|
|
|
|
13 |
---
|
14 |
|
15 |
# Metric Card for Code Eval
|
|
|
10 |
tags:
|
11 |
- evaluate
|
12 |
- metric
|
13 |
+
description: >-
|
14 |
+
This metric implements the evaluation harness for the HumanEval problem solving dataset
|
15 |
+
described in the paper "Evaluating Large Language Models Trained on Code"
|
16 |
+
(https://arxiv.org/abs/2107.03374).
|
17 |
---
|
18 |
|
19 |
# Metric Card for Code Eval
|