Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
13.5
TFLOPS
37
35
64
Marc Sun
marcsun13
Follow
pierrci's profile picture
andrewrreed's profile picture
youyewei0228's profile picture
89 followers
·
129 following
_marcsun
SunMarc
AI & ML interests
LLM, Quantization, Training, Inference
Articles
Fine-tuning LLMs to 1.58bit: extreme quantization made easy
15 days ago
•
139
Accelerate 1.0.0
20 days ago
•
34
Llama 3.1 - 405B, 70B & 8B with multilinguality and long context
Jul 23
•
197
quanto: a pytorch quantization toolkit
Mar 18
•
28
Overview of natively supported quantization schemes in 🤗 Transformers
Sep 12, 2023
•
10
Making LLMs lighter with AutoGPTQ and transformers
Aug 23, 2023
•
28
Organizations
marcsun13
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
2 months ago
Upload folder using huggingface_hub
2
#4 opened 2 months ago by
marcsun13
New activity in
meta-llama/Llama-3.1-405B-FP8
2 months ago
Upload folder using huggingface_hub
2
#8 opened 2 months ago by
marcsun13
New activity in
meta-llama/Llama-3.1-405B-Instruct
2 months ago
Update original/mp8/README.md
#2 opened 2 months ago by
marcsun13
Update original/mp16/README.md
#1 opened 2 months ago by
marcsun13
New activity in
meta-llama/Llama-3.1-405B
2 months ago
Update original/mp16/README.md
#5 opened 2 months ago by
marcsun13
Update original/mp8/README.md
#4 opened 2 months ago by
marcsun13
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
2 months ago
Upload folder using huggingface_hub
2
#2 opened 2 months ago by
marcsun13
New activity in
meta-llama/Llama-3.1-405B-FP8
2 months ago
Upload folder using huggingface_hub
2
#7 opened 2 months ago by
marcsun13
[WIP] Upload folder using huggingface_hub (multi-commit 015597a9a84fd3a9cd8c9844ceb2b85ce89bb1a387968fd94159cb19e4200044)
#6 opened 2 months ago by
marcsun13
Upload folder using huggingface_hub
2
#4 opened 2 months ago by
marcsun13
Upload folder using huggingface_hub
2
#3 opened 2 months ago by
marcsun13
Upload folder using huggingface_hub
2
#2 opened 2 months ago by
marcsun13
Upload folder using huggingface_hub
2
#1 opened 2 months ago by
marcsun13
New activity in
google/flan-t5-xxl
6 months ago
ValueError: Need either a `state_dict` or a `save_folder` containing offloaded weights.
5
#53 opened about 1 year ago by
tuannguyends
New activity in
huggingface/documentation-images
7 months ago
Upload NousResearch-Llama-2-7b-hf_Perplexity.png
1
#292 opened 7 months ago by
marcsun13
Upload NousResearch-Llama-2-7b-hf_Perplexity.png
#291 opened 7 months ago by
marcsun13
New activity in
mlx-community/Llama-2-7b-chat-4-bit
10 months ago
Update README.md
1
#4 opened 10 months ago by
marcsun13
Update README.md
#3 opened 10 months ago by
marcsun13
New activity in
mlx-community/Mistral-7B-Instruct-v0.2-4-bit
10 months ago
Update README.md
#5 opened 10 months ago by
marcsun13
Update config.json
1
#4 opened 10 months ago by
marcsun13
New activity in
mistralai/Mixtral-8x7B-Instruct-v0.1
10 months ago
Intuition for quality decrease after quantization
4
#23 opened 10 months ago by
krumeto
New activity in
mistralai/Mistral-7B-v0.1
11 months ago
Adding `safetensors` variant of this model
2
#91 opened 11 months ago by
lcahill
New activity in
hf-accelerate/model-memory-usage
11 months ago
Llama-2 models don't work since they have auth token required. I have an auth token but it is not doisplaying
7
#16 opened about 1 year ago by
sayambhu
Determining Minimum GPU Memory and Input Text Length Calculation in Model Training
2
#19 opened 12 months ago by
kobe8-24
New activity in
mistralai/Mistral-7B-v0.1
11 months ago
Does Mistral support accelerate library?
4
#65 opened 12 months ago by
Sp1der
New activity in
marcsun13/Llama-2-13B-AWQ
11 months ago
Update config.json
#1 opened 11 months ago by
ybelkada
New activity in
marcsun13/opt-125m-awq
11 months ago
Update config.json
#3 opened 11 months ago by
ybelkada
Update config.json
#2 opened 11 months ago by
ybelkada
Update config.json
#1 opened 11 months ago by
ybelkada
New activity in
huggingface/documentation-images
about 1 year ago
Upload A100_use_cache_True.jpg
#181 opened about 1 year ago by
marcsun13
add images to 163_overview-quantization-transformers
1
#180 opened about 1 year ago by
marcsun13
add overview-quantization-transformers blog images
#179 opened about 1 year ago by
marcsun13
add images for overview-quantization-transformers blog
1
#178 opened about 1 year ago by
marcsun13
overview-quantization-transformers blog images
#177 opened about 1 year ago by
marcsun13
add images to overview-quantization-transformers folder
#176 opened about 1 year ago by
marcsun13
New activity in
hf-accelerate/model-memory-usage
about 1 year ago
Add link to the access token
1
#5 opened about 1 year ago by
marcsun13
Model Memory Consumption of Llama-2 models, access granted
8
#4 opened about 1 year ago by
arkoi