Shyam Sudhakaran's picture

Shyam Sudhakaran

shyamsn97

·

AI & ML interests

Reinforcement Learning, Open-Ended Algorithms, Neural Cellular Automata

Recent Activity

liked a model 5 days ago

NexaAIDev/omnivision-968M

liked a Space 8 days ago

tomaarsen/gliner_medium-v2.1

liked a Space 10 days ago

wwen1997/Framer

Organizations

shyamsn97's activity

upvoted a paper 2 months ago

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 43

upvoted a collection 3 months ago

WebInstruct 🌐 Embeddings 🧱 Models

A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated Sep 4 • 11

upvoted an article 3 months ago

Article

Selective fine-tuning of Language Models with Spectrum

By

•

Sep 3

• 29

upvoted a collection 3 months ago

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 46

upvoted 2 collections 6 months ago

Mixture-of-preference-reward-modeling

The mixture of preference datasets used for reward modeling. • 2 items • Updated Apr 29 • 2

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8 • 22

upvoted a paper 7 months ago

Data-Efficient Multimodal Fusion on a Single GPU

Paper • 2312.10144 • Published Dec 15, 2023 • 6

upvoted 2 collections 8 months ago

Fine-Tuned

41 items • Updated 14 days ago • 7

Merges

Experimental LLM merging • 1292 items • Updated 14 days ago • 7

upvoted a paper 10 months ago

Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11 • 36

upvoted a collection 11 months ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Jul 30 • 30

upvoted a paper about 1 year ago

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 14

upvoted a collection about 1 year ago

🚂 SD-XL Training Suite

All the steps to train your own SD-XL custom model • 7 items • Updated Oct 3 • 21

upvoted a paper over 1 year ago

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Paper • 2307.06949 • Published Jul 13, 2023 • 50