hai
cloudyu
AI & ML interests
Personal contributor
m2 ultra 192G
QQ 206 887 187
Organizations
cloudyu's activity
Adding Evaluation Results
#16 opened 15 days ago
by
leaderboard-pr-bot
不知道下载哪些内容
1
#18 opened 2 months ago
by
qcnace
welcome feedback
#1 opened 25 days ago
by
cloudyu
Adding Evaluation Results
#1 opened 7 months ago
by
leaderboard-pr-bot
Adding Evaluation Results
#7 opened 7 months ago
by
leaderboard-pr-bot
Adding Evaluation Results
#3 opened 7 months ago
by
leaderboard-pr-bot
Adding Evaluation Results
1
#4 opened 7 months ago
by
ac-automata
Adding Evaluation Results
#5 opened about 2 months ago
by
leaderboard-pr-bot
Adding Evaluation Results
#1 opened 2 months ago
by
leaderboard-pr-bot
mistral-chat doesn't work
6
#12 opened 3 months ago
by
cloudyu
mamba bug
5
#4 opened 3 months ago
by
cloudyu
How can I use this great model in python script?
1
#1 opened 3 months ago
by
cloudyu
example code doesn't work at all
7
#2 opened 3 months ago
by
cloudyu
how to use these great models in python scripts?
1
#1 opened 3 months ago
by
cloudyu
Update README.md with license information
#4 opened 3 months ago
by
Chen-01AI
Update README.md with license information
#15 opened 3 months ago
by
Chen-01AI
Update README.md with license information
#6 opened 3 months ago
by
Chen-01AI
Update README.md with license information
#4 opened 3 months ago
by
Chen-01AI
Update README.md with license information
#2 opened 3 months ago
by
Chen-01AI
Update README.md with license information
#17 opened 3 months ago
by
Chen-01AI
Reconvert GGUF for the MoE, due to llama.cpp update
1
#1 opened 5 months ago
by
CombinHorizon
good model
1
#1 opened 5 months ago
by
gopi87
"status" is "FINISHED", but I cannot find the result of my model
1
#694 opened 6 months ago
by
cloudyu
output is not correct.
1
#7 opened 6 months ago
by
flymonk
can you plesse share how to make this version?
3
#3 opened 6 months ago
by
cloudyu
How much RAM does it need to run on Mac m1?
5
#2 opened 6 months ago
by
davideuler
MMLU is only 25.64, anything wrong?
5
#8 opened 6 months ago
by
cloudyu
VRAM Estimates
6
#3 opened 7 months ago
by
ernestr
From your work, I find a new way to do model ensemble
1
#14 opened 6 months ago
by
xxx1
Hardware requirement
2
#5 opened 7 months ago
by
Dtree07
4x version
1
#15 opened 7 months ago
by
ehartford
Very interesting
1
#1 opened 7 months ago
by
ehartford
Thank you for your continued contribution to Chinese-language community|感谢你对中文社区的持续贡献
1
#1 opened 7 months ago
by
sdakfjlkfasf
Why did you take down gemma-7b-it-dpo-v1
1
#2 opened 7 months ago
by
rombodawg
how to run this model?
3
#1 opened 7 months ago
by
cloudyu
Adding Evaluation Results
1
#1 opened 7 months ago
by
leaderboard-pr-bot
这个是基于中文的微调吗 效果这么好
1
#2 opened 8 months ago
by
xuan0126
Train after merging?
2
#1 opened 8 months ago
by
adi-kmt
how to run this model
#2 opened 8 months ago
by
cloudyu
Upload tokenizer.model
1
#1 opened 8 months ago
by
Nexesenex
Upload tokenizer.model
#2 opened 8 months ago
by
Nexesenex
how to dequantised from q5 to f16?
#7 opened 8 months ago
by
cloudyu
Update README.md
#2 opened 8 months ago
by
cloudyu
Update README.md
#1 opened 8 months ago
by
cloudyu
Can you make a 2.4bpw exl2 quantisation for this model?
4
#1 opened 8 months ago
by
xldistance
How did you train the gating?
10
#6 opened 9 months ago
by
osanseviero
Unable to access cloudyu/Pluto_24B_DPO_400
1
#1 opened 8 months ago
by
umarbutler
this is really great dataset
1
#2 opened 9 months ago
by
cloudyu
Could you share the training script?
1
#1 opened 9 months ago
by
andysalerno
Announcement: Flagging merged models with incorrect metadata
82
#510 opened 9 months ago
by
clefourrier
congrat!new SOTA!
4
#1 opened 9 months ago
by
cloudyu
How many GPU memories that the MoE module needs?
2
#8 opened 9 months ago
by
Jazzlee
The function_calling and translation abilities are weaker than Mixtral 8x7b
1
#11 opened 9 months ago
by
bingw5
Multi-langua?
1
#7 opened 9 months ago
by
oFDz
8.0bpw-h8-exl2 quant of this model
6
#1 opened 9 months ago
by
Light4Bear