9 9 3

Sachith Gunasekara

sachithgunasekara

AI & ML interests

Large Language Models (LLMs), Deep Learning, AI Safety/Privacy

Recent Activity

updated a dataset 21 days ago

sachithgunasekara/self-discover-mistral-modified-MATH-eval-annotated

updated a dataset 22 days ago

sachithgunasekara/self-discover-mistral-modified-MATH-eval

updated a dataset 22 days ago

sachithgunasekara/self-discover-mistral-modified-bbh-eval

View all activity

Articles

nanoJAXGPT: A pedagogical introduction to JAX/Equinox

Oct 23

• 2

Organizations

sachithgunasekara's activity

upvoted an article about 1 month ago

Article

nanoJAXGPT: A pedagogical introduction to JAX/Equinox

•

Oct 23

• 2

upvoted 3 articles 2 months ago

Article

Introducing the SQL Console on Datasets

Sep 17

• 20

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 171

Article

🌟 Easy Fine-Tuning with Hugging Face SQL Console, Notebook Creator, and SFT

•

Sep 24

• 12

upvoted 2 papers 2 months ago

How FaR Are Large Language Models From Agents with Theory-of-Mind?

Paper • 2310.03051 • Published Oct 4, 2023 • 34

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted 2 articles 6 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 201

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 97

upvoted a collection 8 months ago

Open-Bezoar

Collection

Small, Cost-Effective and Open Models Trained on Mixes of Instruction Data • 7 items • Updated Apr 19 • 6