anton-l's picture
anton-l HF staff
release
172e214
|
raw
history blame
558 Bytes
# Educational value classifier
### 1. Finetune a model for educational value regression
* edit `train_edu_bert.slurm`
```bash
--base_model_name="Snowflake/snowflake-arctic-embed-m" \ # BERT-like base model
--dataset_name="HuggingFaceTB/LLM_juries_fineweb_430k_annotations" \ # Llama3-annotated eduational value dataset
--target_column="score"
```
* run the training script on a SLURM cluster:
```bash
sbatch train_edu_bert.slurm
```
### 2. Annotate a dataset with the educational scores predicted by the model
```bash
sbatch run_edu_bert.slurm
```