rshacter commited on
Commit
64a28f6
1 Parent(s): f73a58a

Update README.md

Browse files

his is an exercise for the Uplimit class: Finetuning LLMs
Take 1: Basic model minimal configurations

The basic notebook configuration,to ensure I could run the training and submit it.
Older LLLM EleutherAI/gpt-neo-1.3B
With:
max_steps=100
warmup_steps=10

orpo_config = ORPOConfig(
learning_rate=1e-5,
per_device_train_batch_size=4,
gradient_accumulation_steps=4,
max_steps=100,
warmup_steps=10,
gradient_checkpointing=True,
fp16=True,
logging_steps=10,
output_dir="./orpo_output",
optim="adamw_torch",
remove_unused_columns=False,
max_length=max_length,
max_prompt_length=512,
report_to="none",
)

Performance notes:

Files changed (1) hide show
  1. README.md +32 -2
README.md CHANGED
@@ -1,12 +1,42 @@
1
  ---
2
  library_name: transformers
3
- tags: []
 
 
 
 
 
4
  ---
5
 
 
6
  # Model Card for Model ID
7
 
8
  <!-- Provide a quick summary of what the model is/does. -->
9
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
 
11
 
12
  ## Model Details
 
1
  ---
2
  library_name: transformers
3
+ datasets:
4
+ - mlabonne/orpo-dpo-mix-40k
5
+ language:
6
+ - en
7
+ base_model:
8
+ - EleutherAI/gpt-neo-1.3B
9
  ---
10
 
11
+
12
  # Model Card for Model ID
13
 
14
  <!-- Provide a quick summary of what the model is/does. -->
15
+ This is an exercise for the Uplimit class: Finetuning LLMs
16
+ Take 1: Basic model minimal configurations
17
+
18
+ The basic notebook configuration,to ensure I could run the training and submit it.
19
+ Older LLLM EleutherAI/gpt-neo-1.3B
20
+ With:
21
+ max_steps=100
22
+ warmup_steps=10
23
+
24
+ orpo_config = ORPOConfig(
25
+ learning_rate=1e-5,
26
+ per_device_train_batch_size=4,
27
+ gradient_accumulation_steps=4,
28
+ max_steps=100,
29
+ warmup_steps=10,
30
+ gradient_checkpointing=True,
31
+ fp16=True,
32
+ logging_steps=10,
33
+ output_dir="./orpo_output",
34
+ optim="adamw_torch",
35
+ remove_unused_columns=False,
36
+ max_length=max_length,
37
+ max_prompt_length=512,
38
+ report_to="none",
39
+ )
40
 
41
 
42
  ## Model Details