pszemraj
/

tFINE-850m-24x24-instruct-L2

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on 29 days ago

Commit

1cc4203

•

1 Parent(s): 59876ca

Update README.md

Files changed (1) hide show

README.md +7 -34

README.md CHANGED Viewed

@@ -5,33 +5,21 @@ language:
 license: apache-2.0
 base_model: pszemraj/tFINE-850m-24x24-v0.5-instruct-L1
 tags:
-- generated_from_trainer
-model-index:
-- name: tFINE-850m-24x24-v0.5-instruct-L1-infinity-instruct-7m-T2T_en-1024-v2
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# tFINE-850m-24x24-v0.5-instruct-L1-infinity-instruct-7m-T2T_en-1024-v2
-This model is a fine-tuned version of [pszemraj/tFINE-850m-24x24-v0.5-instruct-L1](https://huggingface.co/pszemraj/tFINE-850m-24x24-v0.5-instruct-L1) on the pszemraj/infinity-instruct-7m-T2T_en dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.2542
 - Num Input Tokens Seen: 750938410
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
@@ -50,18 +38,3 @@ No additional optimizer arguments
 - lr_scheduler_warmup_ratio: 0.03
 - num_epochs: 1.0
-### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Input Tokens Seen |
-|:-------------:|:------:|:----:|:---------------:|:-----------------:|
-| 1.32          | 0.2527 | 2000 | 1.3214          | 189801824         |
-| 1.2614        | 0.5053 | 4000 | 1.2815          | 379241088         |
-| 1.2367        | 0.7580 | 6000 | 1.2595          | 568955808         |
-### Framework versions
-- Transformers 4.46.0.dev0
-- Pytorch 2.5.1+cu124
-- Datasets 3.1.0
-- Tokenizers 0.20.1

 license: apache-2.0
 base_model: pszemraj/tFINE-850m-24x24-v0.5-instruct-L1
 tags:
+- instruct
+datasets:
+- pszemraj/infinity-instruct-7m-T2T_en
+pipeline_tag: text2text-generation
 ---
+# tFINE-850m-24x24-instruct-L2
+This model is a fine-tuned version of [pszemraj/tFINE-850m-24x24-v0.5-instruct-L1](https://huggingface.co/pszemraj/tFINE-850m-24x24-v0.5-instruct-L1) on the pszemraj/infinity-instruct-7m-T2T_en dataset (config `deduped-L2`).
 It achieves the following results on the evaluation set:
 - Loss: 1.2542
 - Num Input Tokens Seen: 750938410
 ## Training procedure
 - lr_scheduler_warmup_ratio: 0.03
 - num_epochs: 1.0