Rui-Jie Zhu's picture

Rui-Jie Zhu

ridger

·

AI & ML interests

None yet

Organizations

ridger's activity

upvoted a collection 9 days ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 294

upvoted a paper 13 days ago

Gated Linear Attention Transformers with Hardware-Efficient Training

Paper • 2312.06635 • Published Dec 11, 2023 • 6

upvoted 3 papers 19 days ago

Autonomous Driving with Spiking Neural Networks

Paper • 2405.19687 • Published May 30 • 1

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

Paper • 2302.13939 • Published Feb 27, 2023 • 1

Scalable MatMul-free Language Modeling

Paper • 2406.02528 • Published Jun 4 • 10

upvoted a paper about 1 month ago

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21 • 27

upvoted a paper about 2 months ago

Gated Slot Attention for Efficient Linear-Time Sequence Modeling

Paper • 2409.07146 • Published Sep 11 • 19

upvoted a collection 5 months ago

MatMulfree LM

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10 • 25

upvoted 2 papers 7 months ago

HGRN2: Gated Linear RNNs with State Expansion

Paper • 2404.07904 • Published Apr 11 • 17

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Paper • 2404.05014 • Published Apr 7 • 53

upvoted 2 papers about 1 year ago

Scaling Data-Constrained Language Models

Paper • 2305.16264 • Published May 25, 2023 • 17

Neurons in Large Language Models: Dead, N-gram, Positional

Paper • 2309.04827 • Published Sep 9, 2023 • 16