ZHANG Jipeng

OldFriends

AI & ML interests

None yet

Recent Activity

liked a dataset 13 days ago

cognitivecomputations/Code-290k-ShareGPT-Vicuna

liked a dataset 19 days ago

Sterzhang/PVIT-3M

upvoted a collection 22 days ago

MIT Talk 31/10 Papers

Organizations

None yet

OldFriends's activity

upvoted a collection 22 days ago

MIT Talk 31/10 Papers

Collection

14 items • Updated 25 days ago • 29

upvoted a collection about 1 month ago

LLaVA-Critic

Collection

as a general evaluator for assessing model performance • 6 items • Updated Oct 6 • 8

upvoted a paper about 1 month ago

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9 • 69

upvoted an article 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 265

upvoted a collection 4 months ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 63

upvoted an article 4 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 104

upvoted a paper 4 months ago

TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts

Paper • 2407.03203 • Published Jul 3 • 10

upvoted an article 5 months ago

Article

Large-scale Near-deduplication Behind BigCode

May 16, 2023

• 18

upvoted a paper 5 months ago

Jailbreaking as a Reward Misspecification Problem

Paper • 2406.14393 • Published Jun 20 • 12

upvoted a paper 6 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 67

upvoted a paper 8 months ago

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Paper • 2403.17919 • Published Mar 26 • 16