Jeffrey Magder's picture

13 3

Jeffrey Magder

jmagder

·

jmagder

AI & ML interests

None yet

Organizations

None yet

jmagder's activity

upvoted a paper about 2 months ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11 • 50

upvoted 12 papers 4 months ago

Training Compute-Optimal Large Language Models

Paper • 2203.15556 • Published Mar 29, 2022 • 10

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 602

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 96

RoFormer: Enhanced Transformer with Rotary Position Embedding

Paper • 2104.09864 • Published Apr 20, 2021 • 10

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21 • 111

Efficient Transformers: A Survey

Paper • 2009.06732 • Published Sep 14, 2020 • 1

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 44

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Paper • 2205.14135 • Published May 27, 2022 • 11

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 251

Linformer: Self-Attention with Linear Complexity

Paper • 2006.04768 • Published Jun 8, 2020 • 2

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Paper • 2307.08691 • Published Jul 17, 2023 • 8

Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1 • 24