2 30 7

Junjie Chen

coderchen01

https://junjie-chen.info

AI & ML interests

Efficient AI, Multimodal AI, Generative AI

Recent Activity

upvoted a paper 2 days ago

RedPajama: an Open Dataset for Training Large Language Models

upvoted an article 3 days ago

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

upvoted a paper 5 days ago

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

View all activity

Organizations

None yet

coderchen01's activity

upvoted a paper 2 days ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published 6 days ago • 47

upvoted an article 3 days ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

•

May 21

• 34

upvoted a paper 5 days ago

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Paper • 2411.09944 • Published 11 days ago • 12

upvoted a paper 6 days ago

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published 14 days ago • 16

upvoted a paper 18 days ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published 21 days ago • 44

upvoted an article about 1 month ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

•

Oct 9, 2023

• 8

upvoted a paper about 1 month ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

upvoted 2 articles about 1 month ago

Article

Scaling AI-based Data Processing with Hugging Face + Dask

Oct 9

• 23

Article

How 🤗 Accelerate runs very large models thanks to PyTorch

Sep 27, 2022

• 10

upvoted 2 papers about 2 months ago

MLP-KAN: Unifying Deep Representation and Function Learning

Paper • 2410.03027 • Published Oct 3 • 28

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2 • 25

upvoted 2 papers 2 months ago

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published Sep 20 • 48

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10 • 55

upvoted 3 papers 4 months ago

upvoted a collection 5 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 217

upvoted 3 papers 5 months ago

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

Paper • 2407.04051 • Published Jul 4 • 35

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2 • 49

MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

Paper • 2407.00468 • Published Jun 29 • 34