Huu-Hiep Nguyen's picture

19 353

Huu-Hiep Nguyen

hiepnh

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

nvidia/Hymba-1.5B-Base

liked a model 2 days ago

briaai/RMBG-2.0

liked a model 6 days ago

NexaAIDev/omnivision-968M

View all activity

Organizations

None yet

hiepnh's activity

upvoted a collection about 2 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 30 days ago • 488

upvoted a collection 2 months ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Sep 12 • 78

upvoted a collection 3 months ago

Yi-Coder

4 items • Updated Sep 4 • 30

upvoted a collection 5 months ago

DeepSeekCoder-V2

6 items • Updated Sep 5 • 83

upvoted a paper 6 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 150

upvoted 2 articles 6 months ago

Article

Open-source LLMs as LangChain Agents

Jan 24

• 36

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 118

upvoted a paper 7 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 254

upvoted a paper 9 months ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8 • 60

upvoted 8 papers about 1 year ago

AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning

Paper • 2311.00257 • Published Nov 1, 2023 • 8

FlashDecoding++: Faster Large Language Model Inference on GPUs

Paper • 2311.01282 • Published Nov 2, 2023 • 35

Sparse Finetuning for Inference Acceleration of Large Language Models

Paper • 2310.06927 • Published Oct 10, 2023 • 14

Finite Scalar Quantization: VQ-VAE Made Simple

Paper • 2309.15505 • Published Sep 27, 2023 • 21

LMDX: Language Model-based Document Information Extraction and Localization

Paper • 2309.10952 • Published Sep 19, 2023 • 65

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 32

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 25

Large Language Model for Science: A Study on P vs. NP

Paper • 2309.05689 • Published Sep 11, 2023 • 20

upvoted 2 papers over 1 year ago

OctoPack: Instruction Tuning Code Large Language Models

Paper • 2308.07124 • Published Aug 14, 2023 • 28

PolyLM: An Open Source Polyglot Large Language Model

Paper • 2307.06018 • Published Jul 12, 2023 • 25