sjrhuschlee
commited on
Commit
•
4a0e5e6
1
Parent(s):
c8b512d
Update README.md
Browse files
README.md
CHANGED
@@ -7,3 +7,31 @@ tags:
|
|
7 |
- deberta
|
8 |
- deberta-v3
|
9 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
- deberta
|
8 |
- deberta-v3
|
9 |
---
|
10 |
+
|
11 |
+
# deberta-v3-base for QA
|
12 |
+
|
13 |
+
This is the [deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) model, fine-tuned using the [SQuAD2.0](https://huggingface.co/datasets/squad_v2) dataset. It's been trained on question-answer pairs, including unanswerable questions, for the task of Question Answering.
|
14 |
+
|
15 |
+
|
16 |
+
## Overview
|
17 |
+
**Language model:** deberta-v3-base
|
18 |
+
**Language:** English
|
19 |
+
**Downstream-task:** Extractive QA
|
20 |
+
**Training data:** SQuAD 2.0
|
21 |
+
**Eval data:** SQuAD 2.0
|
22 |
+
**Code:** See [an example QA pipeline on Haystack](https://haystack.deepset.ai/tutorials/first-qa-system)
|
23 |
+
**Infrastructure**:
|
24 |
+
|
25 |
+
## Hyperparameters
|
26 |
+
|
27 |
+
```
|
28 |
+
batch_size = 12
|
29 |
+
n_epochs = 4
|
30 |
+
base_LM_model = "deberta-v3-base"
|
31 |
+
max_seq_len = 512
|
32 |
+
learning_rate = 2e-5
|
33 |
+
lr_schedule = LinearWarmup
|
34 |
+
warmup_proportion = 0.2
|
35 |
+
doc_stride=128
|
36 |
+
max_query_length=64
|
37 |
+
```
|