Alexandre Marques's picture

17 6

Alexandre Marques

alexmarques

·

anmarques

AI & ML interests

None yet

Recent Activity

updated a collection 3 days ago

Llama-3.1 Quantization

updated a dataset 3 days ago

neuralmagic/Inference_performance_Llama_3.1_vllm0.6.1.post2

updated a model 4 days ago

neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16

View all activity

Organizations

alexmarques's activity

upvoted a collection 4 days ago

Sparse-Llama-3.1-2of4

2:4 sparse versions of Llama-3.1, including transfer learning • 7 items • Updated 4 days ago • 1

upvoted 2 papers 20 days ago

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6 • 7

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published 21 days ago • 44

upvoted a collection 2 months ago

Llama-3.2 Quantization

Llama 3.2 models quantized by Neural Magic • 9 items • Updated Sep 26 • 9

upvoted 2 collections 4 months ago

Llama-3.1 Quantization

Neural Magic quantized Llama-3.1 models • 22 items • Updated 3 days ago • 39

INT8 LLMs for vLLM

Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 50 items • Updated Sep 26 • 10