Cartinoe5930 (Hyunwoo Ko)

upvoted an article 27 days ago

Article

Navigating Korean LLM Research #2: Evaluation Tools

By

•

27 days ago

• 5

upvoted an article 28 days ago

Article

Navigating Korean LLM Research #1: Models

By

•

28 days ago

• 18

upvoted a paper about 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 134

upvoted an article 3 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19

• 73

upvoted 2 papers 3 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 115

EXAONE 3.0 7.8B Instruction Tuned Language Model

Paper • 2408.03541 • Published Aug 7 • 34

upvoted a paper 4 months ago

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Paper • 2407.16154 • Published Jul 23 • 21

upvoted an article 4 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 215

upvoted a paper 4 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3 • 48

upvoted an article 4 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 76

upvoted an article 5 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23

• 55

upvoted a collection 5 months ago

🔍 Daily Picks in Interpretability & Analysis of LMs

Collection

Outstanding research in interpretability and evaluation of language models, summarized • 82 items • Updated 1 day ago • 91

upvoted 3 papers 5 months ago

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Paper • 2406.12624 • Published Jun 18 • 36

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Paper • 2406.11813 • Published Jun 17 • 30

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

upvoted an article 6 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 367

upvoted 2 papers 6 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 63

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 67

upvoted an article 6 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

By

•

May 7

• 39

upvoted a collection 7 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 217

Hyunwoo Ko

AI & ML interests

Organizations

Cartinoe5930's activity

Navigating Korean LLM Research #2: Evaluation Tools

Navigating Korean LLM Research #1: Models

Training Language Models to Self-Correct via Reinforcement Learning

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

EXAONE 3.0 7.8B Instruction Tuned Language Model

DDK: Distilling Domain Knowledge for Efficient Large Language Models

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

AgentInstruct: Toward Generative Teaching with Agentic Flows

The Rise of Agentic Data Generation

🪆 Introduction to Matryoshka Embedding Models

🔍 Daily Picks in Interpretability & Analysis of LMs

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Uncensor any LLM with abliteration

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

RLHF Workflow: From Reward Modeling to Online RLHF

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Model Merging