Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
144.5
TFLOPS
672
15
190
Arthur Zucker
ArthurZ
Follow
krishKarnan1's profile picture
Aurelien-Morgan's profile picture
AyoubIgh's profile picture
292 followers
Ā·
17 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Recent Activity
reacted to
Xenova
's
post
with š„
about 17 hours ago
reacted to
davidberenstein1957
's
post
with š
about 17 hours ago
reacted to
LukeNeumann
's
post
with š¤Æ
about 17 hours ago
View all activity
Articles
Fixing Gradient Accumulation
Oct 16
ā¢
41
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
Aug 21
ā¢
22
Fine-Tuning Gemma Models in Hugging Face
Feb 23
ā¢
23
Code Llama: Llama 2 learns to code
Aug 25, 2023
ā¢
8
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mistralai/Pixtral-Large-Instruct-2411
2 days ago
Upload transformers version
3
#3 opened 3 days ago by
ArthurZ
New activity in
huggingface/documentation-images
6 days ago
Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png
1
#392 opened 6 days ago by
kwen2501
New activity in
mistral-community/pixtral-12b
about 1 month ago
Update model weight
8
#13 opened about 1 month ago by
nguyen-brat
Update hidden_act to silu
2
#14 opened about 1 month ago by
ArthurZ
New activity in
rhymes-ai/Aria
about 1 month ago
llama.cpp support
9
#1 opened about 1 month ago by
ayyylol
New activity in
google/gemma-2-2b-jpn-it
about 2 months ago
tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened about 2 months ago by
dahara1
New activity in
mistral-community/pixtral-12b
about 2 months ago
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened about 2 months ago by
Valadaro
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
about 2 months ago
hidden_activation vs hidden_act in config.json
2
#10 opened 2 months ago by
heheda
New activity in
mistral-community/pixtral-12b-240910
about 2 months ago
How to use safetensors?
2
#13 opened 2 months ago by
prathi1729
New activity in
mistral-community/pixtral-12b
2 months ago
lamma cpp ht to gguf not working
4
#2 opened 2 months ago by
RameshRajamani
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
3 months ago
8-kv-heads
8
#14 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
3 months ago
Update config.json
#17 opened 3 months ago by
ArthurZ
Config KV Heads should be 8 now?
1
#16 opened 3 months ago by
tanmaylaud
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
3 months ago
8 kv heads
2
#13 opened 4 months ago by
kkokkie2360
New activity in
meta-llama/Llama-3.1-405B-FP8
3 months ago
8-kv-heads
#15 opened 4 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B
3 months ago
8-kv-heads
3
#21 opened 4 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-Instruct
3 months ago
8-kv-heads
4
#17 opened 4 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
4 months ago
Updated eos_token to include multiple IDs
1
#14 opened 4 months ago by
vontimitta
Update tokenizer to prepend special token
#12 opened 4 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-70B
4 months ago
Update tokenizer to prepend special token
1
#11 opened 4 months ago by
lysandre
Load more