Burkov
Andriy
AI & ML interests
None yet
Organizations
None yet
Andriy's activity
Issues with FSDP and DeepSpeed During Distributed Training for Gemma
5
#30 opened 4 months ago
by
anandhperumal
Why a separate release?
#1 opened 4 months ago
by
Andriy
add_special_tokens=True doesn't add eos token at the end of the sequence
1
#4 opened 4 months ago
by
Andriy
Where is the model? 0 downloads means nobody can use it. Please fix.
10
#1 opened 6 months ago
by
Andriy
How does v0.2 manages to support 32k token context without Sliding Window Attention?
4
#85 opened 7 months ago
by
Andriy
What is the max. content length of Mistral-7B-Instruct-v0.2?
17
#43 opened 9 months ago
by
hanshupe
Longer inference time
2
#4 opened 7 months ago
by
dittops
Finetuning dataset
#35 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#4 opened 7 months ago
by
Andriy
Finetuning dataset
#2 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#1 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#3 opened 7 months ago
by
Andriy
instruct-finetuning dataset
1
#2 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#2 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#5 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#3 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#1 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#2 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
1
#1 opened 7 months ago
by
Andriy
Instruct-finetuning dataset
#4 opened 7 months ago
by
Andriy