Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
144.5
TFLOPS
630
11
144
Arthur Zucker
ArthurZ
Follow
ravikumarmn's profile picture
qqlzfmn's profile picture
MKSH's profile picture
216 followers
·
14 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Articles
Fine-Tuning Gemma Models in Hugging Face
Feb 23
•
13
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
2
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
google/gemma-2-27b-it
2 days ago
Model repeating information and "spitting out" random characters
1
#12 opened 3 days ago by
brazilianslib
New activity in
google/gemma-2-27b-it
3 days ago
Hallucinations, misspellings etc. Something seems broken?
16
#10 opened 3 days ago by
sam-paech
transformers load fails?
7
#6 opened 4 days ago by
bdambrosio
New activity in
google/gemma-2-9b
5 days ago
Runtime autograd error due to inplace operations
1
#4 opened 5 days ago by
xianbin
New activity in
microsoft/Florence-2-large
5 days ago
Please add to llama.cpp and ollama
2
#21 opened 9 days ago by
KeilahElla
New activity in
meta-llama/Meta-Llama-3-8B
28 days ago
Why are "add_bos_token" and "add_eos_token" missing in tokenizer_config.json ?
1
#140 opened about 2 months ago by
ekurtic
New activity in
mistralai/Mistral-7B-Instruct-v0.3
about 1 month ago
Slow tokenizer problem.
4
#22 opened about 1 month ago by
bradhutchings
New activity in
meta-llama/Meta-Llama-3-8B
about 1 month ago
LlamaTokenizerFast.from_pretrained gives incorrect number of tokens for Llama3
2
#156 opened about 1 month ago by
farzadab
New activity in
mistralai/Mistral-7B-Instruct-v0.3
about 1 month ago
Add minor reference to transformers
4
#7 opened about 1 month ago by
osanseviero
Upload tokenizer
#6 opened about 1 month ago by
ArthurZ
Upload tokenizer
#5 opened about 1 month ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
about 1 month ago
Update README.md
#4 opened about 1 month ago by
ArthurZ
Update README.md
#3 opened about 1 month ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
about 1 month ago
Update README.md
#4 opened about 1 month ago by
ArthurZ
Update config.json
1
#3 opened about 1 month ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
about 1 month ago
Upload MistralForCausalLM
#2 opened about 1 month ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
about 1 month ago
Upload MistralForCausalLM
#2 opened about 1 month ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
about 1 month ago
Upload tokenizer
1
#1 opened about 1 month ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
about 1 month ago
Upload tokenizer
#1 opened about 1 month ago by
ArthurZ
New activity in
01-ai/Yi-9B
about 2 months ago
Tokenizer inconsistencies related to HTML tags
4
#11 opened 3 months ago by
sanderland
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
about 2 months ago
Update config.json
1
#105 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
about 2 months ago
Update config.json
3
#49 opened about 2 months ago by
ArthurZ
The sample code for usage with Transformers is incorrect.
2
#45 opened about 2 months ago by
endNone
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
about 2 months ago
How to use EOT_ID
4
#54 opened 2 months ago by
saksham-lamini
New activity in
meta-llama/Meta-Llama-3-8B
about 2 months ago
Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.
9
#72 opened 2 months ago by
tianke0711
Unable to load the model for Torch versions starting from 2.0.1
8
#34 opened 2 months ago by
benhachem
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
about 2 months ago
Update config.json
4
#33 opened 2 months ago by
ArthurZ
Update README.md
1
#31 opened 2 months ago by
shokim
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
about 2 months ago
Update tokenizer_config.json
16
#60 opened 2 months ago by
Navanit-shorthills
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
2 months ago
Update config.json
1
#71 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B
2 months ago
Update generation_config.json
#10 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
2 months ago
Update generation_config.json
#30 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B
2 months ago
Update generation_config.json
1
#68 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
2 months ago
Update generation_config.json
1
#62 opened 2 months ago by
ArthurZ
Update generation_config.json
#61 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B
2 months ago
Update generation_config.json
#67 opened 2 months ago by
ArthurZ
Generated text is garbled?
5
#53 opened 2 months ago by
gbhall
is there a chat model? or i need to use specific instruction
2
#63 opened 2 months ago by
Barianc
Llama-3-8B not giving the entire outcome in Google Colab
2
#55 opened 2 months ago by
sayanroy07
how to download llama3
1
#58 opened 2 months ago by
pacopascal
The model repeats the question/answer multiple times in the output
4
#60 opened 2 months ago by
ameljelidi
Issues with tokenizer causing bad performance of model.
2
#66 opened 2 months ago by
Takuonline
Hi, I try to load with LlamaForCausalLM, LlamaTokenizer, but it show me the error that "not a string"
7
#64 opened 2 months ago by
hjewr
New activity in
TRI-ML/mamba-7b-rw
2 months ago
Adding `safetensors` variant of this model
3
#4 opened 2 months ago by
lucataco
New activity in
google/recurrentgemma-2b-it
2 months ago
Fix tokenizer
#11 opened 3 months ago by
pcuenq
New activity in
google/recurrentgemma-2b
2 months ago
Fix tokenizer
#6 opened 3 months ago by
pcuenq
New activity in
google/recurrentgemma-2b-it
2 months ago
ValueError: The device_map provided does not give any device for the following parameters: model.normalizer
9
#8 opened 3 months ago by
LaferriereJC
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
2 months ago
Tokenizer mismatch all the time
2
#47 opened 2 months ago by
tian9
New activity in
meta-llama/Meta-Llama-3-8B
2 months ago
Update tokenizer_config.json to prepend the bos token
7
#35 opened 2 months ago by
eduagarcia
Rotary position embeddings not loaded
1
#39 opened 2 months ago by
cwbc
Rename original/tokenizer.model to tokenizer.model
3
#6 opened 2 months ago by
winglian
New activity in
google/recurrentgemma-2b-it
3 months ago
ValueError when use multiple GPUs for inference
2
#10 opened 3 months ago by
aladinggit
New activity in
google/gemma-1.1-7b-it
3 months ago
Fix slow tokenizer
2
#14 opened 3 months ago by
pcuenq
New activity in
google/recurrentgemma-2b-it
3 months ago
I can't load this model on L4 GPU
2
#5 opened 3 months ago by
albusdd
New activity in
google/gemma-1.1-7b-it-GGUF
3 months ago
Add quantized GGUFs?
1
#2 opened 3 months ago by
MoonRide
New activity in
hf-internal-testing/tiny-random-gpt2
3 months ago
Adding `safetensors` variant of this model
#2 opened 5 months ago by
SFconvertbot
New activity in
ai21labs/Jamba-v0.1
3 months ago
Fix bias logic to enable QLoRA finetuning
3
#5 opened 3 months ago by
winglian
New activity in
llava-hf/llava-v1.6-mistral-7b-hf
3 months ago
wrong padding token
2
#2 opened 4 months ago by
aliencaocao
New activity in
hpcai-tech/grok-1
3 months ago
Upload tokenizer
7
#4 opened 3 months ago by
ArthurZ
New activity in
CohereForAI/c4ai-command-r-v01
3 months ago
Update README.md
1
#34 opened 3 months ago by
ArthurZ
Load more