Edit model card

text-pic-request-identifier

This model is a fine-tuned version of distilbert/distilbert-base-uncased on an synthetic dataset.

It achieves the following results on the evaluation set:

  • Loss: 0.0015
  • Accuracy: 0.9996

Model description

Model identifies if user is asking for a picture or a text.

Intended uses & limitations

Intended use for chat applications to either route the message to a text model or an image model.

Model will return 'pic' or 'text'

Training and evaluation data

Model was trained on synthetic dataset consisting of around ~25k messages. Messages were generated by different LLM's including gpt4,gpt4o,gpt4o-mini,gpt3.5-turbo

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 2

Training results

Training Loss Epoch Step Validation Loss Accuracy
0.0391 1.0 844 0.0021 0.9996
0.0021 2.0 1688 0.0015 0.9996

Framework versions

  • Transformers 4.44.0
  • Pytorch 2.3.1
  • Datasets 2.21.0
  • Tokenizers 0.19.1

How to use

from transformers import (
    pipeline
)

picClassifier = pipeline("text-classification", model="andriadze/text-pic-request-identifier")
res = picClassifier('Can you send me a selfie?')
Downloads last month
12
Safetensors
Model size
67M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for andriadze/text-pic-request-identifier

Finetuned
(6673)
this model

Dataset used to train andriadze/text-pic-request-identifier