metadata

license: apache-2.0
datasets:
  - HiTZ/casimedicos-squad
language:
  - en
  - es
  - fr
  - it
metrics:
  - f1
library_name: transformers
widget:
  - text: >-
      The answer is 2, an EMG, which would be the diagnostic test to confirm the
      diagnostic suspicion of ALS, the imaging tests both CT, but especially MRI
      would help us to rule out, but would not confirm the diagnosis.
    example_title: Example 1
  - text: >-
      Although other hemorrhagic diseases can have a prolonged thromboplastin
      time, due to the intensity of the lesion and the child's sex and family
      history, the most likely diagnosis is hemophilia.
    example_title: Example 2
  - text: >-
      Paradoxical pulse is a drop in blood pressure > 10 mmHg during
      inspiration;it represents an exaggeration of the physiological phenomenon
      consisting ofinspiratory lowering of BP (normal up to 10 mmHg). In cardiac
      tamponade,inspiration, which causes an increase in blood flow to the right
      chambers,increasing their volume, secondarily causes a displacement of
      theinterventricular septum to the left, so that the left heart lodges
      andexpels less blood during systole and the pulse, therefore, decreases.
      In anormal heart this exaggerated displacement, caused by the pressure
      exertedby the tamponade on the RV free wall, does not occur. Sinus X
      represents thesystolic collapse of the venous pulse, i.e., the pressure
      drop due to atrialrelaxation (also partly due to a downward displacement
      of the RV base duringsystole). Sinus Y represents the diastolic collapse
      of the venous pulse,i.e., the pressure drop that occurs from the moment
      blood enters thetricuspid valve into the ventricle. In cardiac tamponade,
      the deep sinus Xis characteristic. In constrictive pericarditis, the deep
      Y sinus. For allthese reasons, the correct answer is 5.
    example_title: Example 3
pipeline_tag: token-classification

mDeBERTa-base for Multilingual Correct Explanation Extraction in the Medical Domain

This model is a fine-tuned version of mdeberta-v3-base for a novel extractive task which consists of identifying the explanation of the correct answer written by medical doctors. The model has been fine-tuned using the multilingual https://huggingface.co/datasets/HiTZ/casimedicos-squad dataset, which includes English, French, Italian and Spanish.

Performance

The model scores 74.64 F1 partial match (as defined in SQuAD extractive QA task) averaged across the 4 languages.

tags: to delimit explanations of the correct answers and the rest at token level.
- 0: explanation of the correct answer
- 1: others

Fine-tuning hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 48
eval_batch_size: 8
seed: random
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 20.0

Framework versions

Transformers 4.30.0.dev0
Pytorch 2.1.2+cu121
Datasets 2.16.1
Tokenizers 0.15.2

Citation

If you use this model please cite the following paper:

@misc{goenaga2023explanatory,
      title={Explanatory Argument Extraction of Correct Answers in Resident Medical Exams}, 
      author={Iakes Goenaga and Aitziber Atutxa and Koldo Gojenola and Maite Oronoz and Rodrigo Agerri},
      year={2023},
      eprint={2312.00567},
      archivePrefix={arXiv}
}

Contact: Iakes Goenaga and Rodrigo Agerri HiTZ Center - Ixa, University of the Basque Country UPV/EHU