Is there a checkpoint after fine-tuning only on `ultrachat_200k`, which we would like to use it to do research on alignment algorithms?
#45 opened 2 months ago
by
AIR-hl
Interview request: genAI evaluation & documentation
#44 opened 2 months ago
by
evatang
Adding Evaluation Results
#42 opened 3 months ago
by
leaderboard-pr-bot
How to remove input token to get only output token ?
#41 opened 4 months ago
by
ducknificient
Multilingual model
#40 opened 4 months ago
by
ducknificient
Instruction Tuning Model
2
#39 opened 4 months ago
by
ducknificient
Request: DOI
#38 opened 4 months ago
by
climbingm
Adding Evaluation Results
#37 opened 5 months ago
by
leaderboard-pr-bot
CUDA assertion error when trying to train
#36 opened 5 months ago
by
brianwilcken
Can you upload the SFT version as well?
#34 opened 6 months ago
by
jiwan-chung
Adding Evaluation Results
#33 opened 6 months ago
by
leaderboard-pr-bot
Adding Evaluation Results
#32 opened 7 months ago
by
asck
Adding Evaluation Results
#31 opened 7 months ago
by
leaderboard-pr-bot
It wrote an credible new recipe for spiced frog salad
#30 opened 8 months ago
by
MartialTerran
Write a story....
#29 opened 8 months ago
by
MartialTerran
Too much Junk vocab words in the vocab.json.
8
#28 opened 8 months ago
by
MartialTerran
Bing (ChatGPT4) analyzes the "def fibonacci_sequence_to_digits(n)" example code.
#27 opened 8 months ago
by
MartialTerran
Update widget example
#26 opened 9 months ago
by
Xenova
Adding Evaluation Results
#25 opened 9 months ago
by
leaderboard-pr-bot
Deployment?
3
#24 opened 9 months ago
by
huggingface9837
[AUTOMATED] Model Memory Requirements
#22 opened 9 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#21 opened 9 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#20 opened 9 months ago
by
model-sizer-bot
Dataset for DPO, with a Template?
1
#17 opened 10 months ago
by
ewqr2130
Prompt format?
4
#16 opened 10 months ago
by
anuragrawal
Minimum supported device?
2
#15 opened 11 months ago
by
sachinmyneni
Transformers unable to load the model
#14 opened 11 months ago
by
iammayur
BFloat16 is not supported on MPS
8
#13 opened 11 months ago
by
nhannn
ImportError: cannot import name 'LlamaTokenizer' from 'transformers' (/Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/site-packages/transformers/__init__.py)
1
#12 opened 11 months ago
by
gmdl007
Training on corpus of text (astronomy) - without templates
1
#11 opened 11 months ago
by
demetera
what are use cases , it is deranged like Joe Biden
2
#10 opened 11 months ago
by
froilo
What is the context size?
1
#9 opened 11 months ago
by
streamerbtw1002
Is it on the leaderboard?
3
#8 opened 11 months ago
by
AIWintermuteAI
You know what we are going to ask
1
#6 opened 11 months ago
by
LaferriereJC
Fine Tuning
3
#5 opened 11 months ago
by
ybsid
You should try training a model with 2B parameters and context length 32000.
1
#3 opened 11 months ago
by
win10
Fantastic work guys!
2
#1 opened 11 months ago
by
dillfrescott