LaZeAsh
/

gemma-2b-lahacks

Generated from Trainer

Model card Files Files and versions Community

LaZeAsh commited on Apr 21

Commit

1c0a926

•

1 Parent(s): 07250ec

Update README.md

Initial Model Card

Files changed (1) hide show

README.md +25 -13

README.md CHANGED Viewed

@@ -11,30 +11,42 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# gemma-2b-lahacks
-This model is a fine-tuned version of [google/gemma-2b-it](https://huggingface.co/google/gemma-2b-it) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.3061
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 1e-05

   results: []
 ---
+# gemma-2b-lahacks 💻
+This model is a fine-tuned version of [google/gemma-2b-it](https://huggingface.co/google/gemma-2b-it).
 It achieves the following results on the evaluation set:
 - Loss: 2.3061
+## Model description 📝
+This model was fine-tuned during LAHacks 2024, the intention of this model is to be able to diagnose a patient appropratiely
+based on the information in their previous medical records, current symptoms, age, sex, and more.
+## Intended uses & limitations ⁉️
+Sample code snippet:
+```py
+```
+Uses: To use Artificial Intelligence technology to diagnose patient based off of multiple parameters ranging from their age to their
+medical record.
+Limitation: There's a highly likelyhood that the model will NOT be great at diagnosing it's users, the amount of time it took to fine-tune
+this model limited how much data we could train it on. With more time a more accurate model would be expected.
+## Training and evaluation data 📈
+The model was trained on data from the research paper 'A New Dataset For Automatic Medical Diagnosis' by Arsène Fansi Tchango, Rishab Goel,
+Zhi Wen, Julien Martel, Joumana Ghosn. The 'release_train_patients.csv' dataset was reduced from it's original 1.3 million rows of data to a
+mere 500-1000 rows of data. This was due to the time it took to fine-tune a model which depended on how big the dataset provided was.
+## Training procedure 🏋️
+The fine-tuning took MULTIPLE, and I mean MULTIPLE tries. Sometimes the dataset provided was very big so the kernel had to be restarted multiple times.
+Additionally, the model was tuned on the default data that Intel offers in their guide to fine-tune a gemma model.
+### Training hyperparameters 🔍
 The following hyperparameters were used during training:
 - learning_rate: 1e-05