Devin Gonier's picture

9 127

Devin Gonier

dgonier

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

allenai/OLMo-2-1124-13B

liked a model 3 days ago

Qwen/QwQ-32B-Preview

liked a model 5 days ago

OuteAI/OuteTTS-0.2-500M-GGUF

View all activity

Organizations

dgonier's activity

upvoted a collection 3 months ago

Jamba-1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22 • 82

upvoted a paper 5 months ago

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1 • 39

upvoted 2 collections 5 months ago

models to evaluate

collecting models I want to evaluate on shadereval-task2: https://github.com/bigcode-project/bigcode-evaluation-harness/pull/173 at fp16!! • 39 items • Updated 13 days ago • 2

Code Evaluation

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29 • 14

upvoted a collection 8 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Oct 22 • 24

upvoted a paper 11 months ago

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11 • 43

upvoted 2 papers about 1 year ago

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 18

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122

upvoted a paper over 1 year ago

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 170