|
--- |
|
tags: |
|
- medical |
|
- mmlu |
|
- medalpaca |
|
- medmcqa |
|
datasets: |
|
- cais/mmlu |
|
- medalpaca/medical_meadow_medqa |
|
- medalpaca/medical_meadow_wikidoc |
|
- openlifescienceai/medmcqa |
|
- bigbio/med_qa |
|
- GBaker/MedQA-USMLE-4-options |
|
- medalpaca/medical_meadow_mmmlu |
|
- medalpaca/medical_meadow_wikidoc_patient_information |
|
- qiaojin/PubMedQA |
|
pipeline_tag: text-generation |
|
--- |
|
### Evaluation results |
|
|
|
| Dataset | GPT-3.5 | Tuned Llama 3 | |
|
|:-------------:|:-----:|:----:| |
|
| MMLU Clinical Knowledge | 69.8| 74.34 | |
|
| MMLU College Biology | 72.2| 72.92 | |
|
| MMLU College Medicine | 61.3| 61.85 | |
|
| MMLU Medical Genetics | 70.0| 76.0 | |
|
| MMLU Professional Medicine| 70.2| 72.43 | |
|
| MMLU Anatomy | 56.3| 61.48 | |