alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

updated a model about 2 hours ago

AtAndDev/marco-qwq-7B

Reacted to prithivMLmods's post with 🤗 about 14 hours ago

Milestone for Flux.1 Dev 🔥 💢The Flux.1 Dev model has crossed 1️⃣0️⃣,0️⃣0️⃣0️⃣ creative public adapters! 🎈 🔗 https://huggingface.co/models?other=base_model:adapter:black-forest-labs/FLUX.1-dev 💢This includes: - 266 Finetunes - 19 Quants - 4 Merges 💢 Here’s the 10,000th public adapter : 😜 + https://huggingface.co/strangerzonehf/Flux-3DXL-Partfile-0006 💢 Page : + https://huggingface.co/strangerzonehf 💢 Collection : + https://huggingface.co/collections/prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

Reacted to burtenshaw's post with ❤️ about 15 hours ago

For anyone looking to boost their LLM fine-tuning and alignment skills this decemeber. We're running this free and open course called smol course. It’s not big like Li Yin and @mlabonne, it’s just smol. 👷 It focuses on practical use cases, so if you’re working on something, bring it along. 👯‍♀️ It’s peer reviewed and open so you can discuss and get feedback. 🤘 If you’re already a smol pro, feel free to drop a star or issue. > > Part 1 starts now, and it’s on instruction tuning! https://github.com/huggingface/smol-course

View all activity

Organizations

AtAndDev's activity

upvoted a paper 3 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published 9 days ago • 35

upvoted 2 collections 6 days ago

🧠 Reasoning Models

5 items • Updated about 24 hours ago • 24

QwQ

Qwen with Questions • 2 items • Updated 6 days ago • 33

upvoted 4 papers 10 days ago

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 74

V-STaR: Training Verifiers for Self-Taught Reasoners

Paper • 2402.06457 • Published Feb 9 • 9

Let's Verify Step by Step

Paper • 2305.20050 • Published May 31, 2023 • 10

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

Paper • 2406.12050 • Published Jun 17 • 19

upvoted a collection 13 days ago

Top LLM

Collection of TOP Open Source LLM, Sort by Best on top • 6 items • Updated Jul 26 • 13

upvoted a paper 27 days ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published 28 days ago • 60

upvoted 7 papers about 1 month ago

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31 • 59

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Paper • 2410.02707 • Published Oct 3 • 47

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25 • 80

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 50

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22 • 11

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 37

upvoted a paper about 2 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

upvoted a paper 2 months ago

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20 • 13

upvoted 2 papers 3 months ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10 • 55

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5 • 30