Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
8
Dmytro Dzhulgakov
dzhulgakov
Follow
BobaZooba's profile picture
21world's profile picture
2 followers
·
0 following
dzhulgakov
dzhulgakov
AI & ML interests
None yet
Recent Activity
New activity
about 2 months ago
meta-llama/Llama-3.2-11B-Vision-Instruct
New activity
about 2 months ago
meta-llama/Llama-3.2-1B-Instruct
View all activity
Organizations
dzhulgakov
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
about 2 months ago
Tokenizer needs to be fixed for BOS handling
#18 opened about 2 months ago by
dzhulgakov
New activity in
meta-llama/Llama-3.2-1B-Instruct
about 2 months ago
Tokenizer BOS behavior is inconsistent with Llama 3.1
1
#5 opened about 2 months ago by
dzhulgakov
New activity in
deepseek-ai/DeepSeek-Coder-V2-Instruct
5 months ago
How important is the grouped_topk?
#6 opened 5 months ago by
dzhulgakov
New activity in
google/gemma-2-9b
5 months ago
Can't repro MMLU: sliding window attention implementation seems broken
3
#11 opened 5 months ago by
dzhulgakov
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
7 months ago
clean_up_tokenization_spaces=True causes formatting issues, why is it set?
#44 opened 7 months ago by
dzhulgakov
New activity in
google/gemma-7b-it
9 months ago
Running sample code gives ma a shape error
1
#22 opened 9 months ago by
dzhulgakov
New activity in
DiscoResearch/mixtral-7b-8expert
12 months ago
Update modeling_moe_mistral.py
2
#1 opened 12 months ago by
bjoernp
commented
a paper
about 1 year ago
Mistral 7B
Paper
•
2310.06825
•
Published
Oct 10, 2023
•
47
•
8