Hieu Ngo's picture

Hieu Ngo

hiieu

·

AI & ML interests

Applied, Post-Training LLM

Recent Activity

upvoted a paper about 19 hours ago

liked a dataset 4 days ago

mlabonne/orca-agentinstruct-1M-v1-cleaned

liked a dataset 4 days ago

microsoft/orca-agentinstruct-1M-v1

Organizations

hiieu's activity

upvoted a paper about 19 hours ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published 3 days ago • 36

upvoted a paper 18 days ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published 21 days ago • 20

upvoted a paper 23 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 27 days ago • 79

upvoted a paper 25 days ago

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21 • 19

upvoted 2 articles about 1 month ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

Oct 20

• 31

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14

• 55

upvoted a paper 2 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9 • 45

upvoted a collection 3 months ago

Gemma 2 ChatQA RAG finetuned

1 item • Updated Sep 2 • 1

upvoted an article 3 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21

• 22

upvoted a paper 3 months ago

Synthesizing Text-to-SQL Data from Weak and Strong LLMs

Paper • 2408.03256 • Published Aug 6 • 10

upvoted a paper 4 months ago

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1 • 22

upvoted a collection 4 months ago

ShieldGemma Release

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated Jul 31 • 11

upvoted a paper 4 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19 • 38

upvoted a collection 4 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated Oct 3 • 59

upvoted a paper 4 months ago

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Paper • 2407.14482 • Published Jul 19 • 25

upvoted a collection 4 months ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 63

upvoted 2 articles 4 months ago

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16

• 32

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 265

upvoted a collection 4 months ago

H2O Danube3

6 items • Updated Oct 17 • 53

upvoted an article 4 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 78