Edmond Jacoupeau's picture

Edmond Jacoupeau

edmond

·

AI & ML interests

None yet

Recent Activity

liked a model 23 days ago

facebook/MobileLLM-1B

liked a model 23 days ago

HuggingFaceTB/SmolLM2-1.7B

upvoted a collection about 1 month ago

View all activity

Organizations

edmond's activity

upvoted a collection about 1 month ago

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14 • 12

upvoted an article 2 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 203

upvoted a paper 4 months ago

KAN or MLP: A Fairer Comparison

Paper • 2407.16674 • Published Jul 23 • 42

upvoted 2 collections 5 months ago

Gemma 2 Release

15 items • Updated Sep 9 • 197

Florence

9 items • Updated Jul 11 • 160

upvoted a paper 6 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23 • 37

upvoted an article 6 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 210

upvoted a collection 6 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 137

upvoted a collection 7 months ago

LLaVA++ (LLaMA-3 and Phi-3-Mini)

Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 • 11 items • Updated Jun 11 • 23

upvoted 3 papers 7 months ago

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10 • 103

Voyager: An Open-Ended Embodied Agent with Large Language Models

Paper • 2305.16291 • Published May 25, 2023 • 9

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Paper • 2206.08853 • Published Jun 17, 2022 • 1

upvoted a collection 7 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Sep 25 • 683

upvoted a paper 7 months ago

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18 • 53

upvoted 4 papers 11 months ago

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

Paper • 2312.17090 • Published Dec 28, 2023 • 4

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Paper • 2312.14385 • Published Dec 22, 2023 • 5

Pixel Aligned Language Models

Paper • 2312.09237 • Published Dec 14, 2023 • 14

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 37

upvoted 2 papers about 1 year ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 18