YangWang92's picture

YangWang92

yangwang92

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

meta-llama/Llama-3.2-3B-Instruct-SpinQuant_INT4_EO8

liked a model 4 days ago

meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8

liked a model 4 days ago

mistralai/Pixtral-Large-Instruct-2411

View all activity

Organizations

yangwang92's activity

upvoted a collection 7 days ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 5 days ago • 229

upvoted a paper 15 days ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published 16 days ago • 63

upvoted a paper 17 days ago

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2 • 12

upvoted a paper 21 days ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 50

upvoted a paper 26 days ago

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

Paper • 2410.18252 • Published about 1 month ago • 5

upvoted a paper about 1 month ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16 • 39

upvoted 3 papers about 2 months ago

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Paper • 2409.20566 • Published Sep 30 • 52

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Paper • 2409.17066 • Published Sep 25 • 27

Phantom of Latent for Large Language and Vision Models

Paper • 2409.14713 • Published Sep 23 • 27

upvoted a collection 2 months ago

GSA

3 items • Updated 13 days ago • 2

upvoted an article 3 months ago

Article

Key Insights into the Law of Vision Representations in MLLMs

By

•

Sep 2

• 18

upvoted a collection 3 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 15 items • Updated Sep 18 • 157

upvoted a paper 3 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16 • 97

upvoted a collection 4 months ago

Chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR. • 2 items • Updated Jul 9 • 27

upvoted a paper 4 months ago

Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

Paper • 2407.10969 • Published Jul 15 • 20

upvoted 2 papers 6 months ago

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26 • 22

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16 • 126

upvoted a paper 9 months ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8 • 60