danielcthompson
/

Bio-ClinicalBERT_vascular_classification

Model card Files Files and versions Community

Daniel Thompson commited on Aug 23

Commit

fb340ba

•

1 Parent(s): 1e6e5e7

Update README.md

Files changed (1) hide show

README.md +14 -6

README.md CHANGED Viewed

@@ -84,22 +84,30 @@ If the length of the clinical text exceeds 512 tokens, you can use a sliding win
 You can view and run the full example on GitHub here:
 [Sliding Window Example Notebook](https://github.com/dannyt101/AAA_classification/blob/main/Stage_1/bio-clinicalBERT_vasc_class_demo.ipynb)
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: None
-- training_precision: float32
-### Training results
 ### Framework versions

 You can view and run the full example on GitHub here:
 [Sliding Window Example Notebook](https://github.com/dannyt101/AAA_classification/blob/main/Stage_1/bio-clinicalBERT_vasc_class_demo.ipynb)
 ## Training and evaluation data
+EHRs were downloaded from [MIMIC-IV clinical notes dataset](https://physionet.org/content/mimic-iv-note/2.2/)
+The EHRs were annotated by a Vascular Surgery Specialist Registrar/Resident and categorized as ‘Vascular’ if there was an acute pathology relevant to vascular surgery during their admission as per [National Health Service (NHS) England Service Specifications for Vascular Services](https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.england.nhs.uk/wp-content/uploads/2017/06/specialised-vascular-services-service-specification-adults.pdf&ved=2ahUKEwiknoKus4uIAxUFwAIHHaaQCBcQFnoECBMQAQ&usg=AOvVaw3yRyS-Ei1fiTNi6dcP8yOL).
 ## Training procedure
+The training was performed using TensorFlow's TPU strategy. Dataset was preprocessed using a sliding window approach to handle text longer than 512 tokens.
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- **Optimizer**: Adam
+- **Learning Rate**: 5e-5
+- **Batch Size**: 16
+- **Epochs**: Maximum of 5
+- **Early Stopping**: Triggered if validation loss did not improve for 2 consecutive epochs
+### Training Results
+The Bio-clinicalBERT model achieved the following results on the validation set:
+| Model              | Accuracy | Precision (Vascular) | Recall (Vascular) | F1-Score (Vascular) | Precision (Non-Vascular) | Recall (Non-Vascular) | F1-Score (Non-Vascular) |
+|--------------------|----------|----------------------|-------------------|---------------------|--------------------------|-----------------------|-------------------------|
+| **Bio-clinicalBERT** | 0.94     | 0.88                 | 0.70              | 0.78                | 0.95                     | 0.98                  | 0.96                    |
 ### Framework versions