nosnelmil
/

GPT2-CompareTransformers-Imdb

Text Classification

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

results

This model is a fine-tuned version of gpt2 on the imdb dataset. It achieves the following results on the evaluation set:

Loss: 0.2797
Accuracy: 0.9331
Precision: 0.9331
Recall: 0.9331
F1: 0.9331
Auroc: 0.9810

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 4
eval_batch_size: 4
seed: 42
distributed_type: multi-GPU
gradient_accumulation_steps: 4
total_train_batch_size: 16
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 3

Training results

Training Loss	Epoch	Step	Accuracy	Auroc	F1	Validation Loss	Precision	Recall
0.1436	0.46	500	0.8935	0.9751	0.8935	0.2923	0.8935	0.8935
0.1621	0.91	1000	0.9261	0.9789	0.9261	0.1984	0.9261	0.9261
0.2196	1.37	1500	0.9289	0.9810	0.9289	0.2082	0.9289	0.9289
0.1457	1.83	2000	0.9325	0.9816	0.9325	0.2282	0.9325	0.9325
0.1103	2.29	2500	0.9305	0.9806	0.9305	0.3201	0.9305	0.9305
0.0679	2.74	3000	0.2797	0.9331	0.9331	0.9331	0.9331	0.9810

Framework versions

Transformers 4.34.1
Pytorch 2.1.0+cu118
Datasets 2.14.6
Tokenizers 0.14.1

Downloads last month: 14

Inference Examples

Text Classification

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for nosnelmil/GPT2-CompareTransformers-Imdb

Base model

openai-community/gpt2

Finetuned

(1175)

this model

Dataset used to train nosnelmil/GPT2-CompareTransformers-Imdb

Evaluation results

Accuracy on imdb
self-reported

0.933
Precision on imdb
self-reported

0.933
Recall on imdb
self-reported

0.933
F1 on imdb
self-reported

0.933

View on Papers With Code