|
--- |
|
language: en |
|
datasets: |
|
- squad_v2 |
|
license: cc-by-4.0 |
|
tags: |
|
- deberta |
|
- deberta-v3 |
|
--- |
|
|
|
# deberta-v3-base for QA |
|
|
|
This is the [deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) model, fine-tuned using the [SQuAD2.0](https://huggingface.co/datasets/squad_v2) dataset. It's been trained on question-answer pairs, including unanswerable questions, for the task of Question Answering. |
|
|
|
|
|
## Overview |
|
**Language model:** deberta-v3-base |
|
**Language:** English |
|
**Downstream-task:** Extractive QA |
|
**Training data:** SQuAD 2.0 |
|
**Eval data:** SQuAD 2.0 |
|
**Code:** See [an example QA pipeline on Haystack](https://haystack.deepset.ai/tutorials/first-qa-system) |
|
**Infrastructure**: |
|
|
|
## Hyperparameters |
|
|
|
``` |
|
batch_size = 12 |
|
n_epochs = 4 |
|
base_LM_model = "deberta-v3-base" |
|
max_seq_len = 512 |
|
learning_rate = 2e-5 |
|
lr_schedule = LinearWarmup |
|
warmup_proportion = 0.2 |
|
doc_stride=128 |
|
max_query_length=64 |
|
``` |
|
|