MohamedAhmedAE's picture
Update README.md
9c29277 verified
|
raw
history blame
713 Bytes
---
tags:
- medical
- mmlu
- medalpaca
- medmcqa
datasets:
- cais/mmlu
- medalpaca/medical_meadow_medqa
- medalpaca/medical_meadow_wikidoc
- openlifescienceai/medmcqa
- bigbio/med_qa
- GBaker/MedQA-USMLE-4-options
- medalpaca/medical_meadow_mmmlu
- medalpaca/medical_meadow_wikidoc_patient_information
- qiaojin/PubMedQA
pipeline_tag: text-generation
---
### Evaluation results
| Dataset | GPT-3.5 | Tuned Llama 3 |
|:-------------:|:-----:|:----:|
| MMLU Clinical Knowledge | 69.8| 74.34 |
| MMLU College Biology | 72.2| 72.92 |
| MMLU College Medicine | 61.3| 61.85 |
| MMLU Medical Genetics | 70.0| 76.0 |
| MMLU Professional Medicine| 70.2| 72.43 |
| MMLU Anatomy | 56.3| 61.48 |