Sugato Ray's picture

Sugato Ray

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 17 hours ago

LLM Training Datasets

liked a dataset about 17 hours ago

megrisdal/llms-txt

updated a collection about 18 hours ago

View all activity

Organizations

sugatoray's activity

commented a paper 11 days ago

Stream of Search (SoS): Learning to Search in Language

Paper • 2404.03683 • Published Apr 1 • 28 •

commented 2 papers 2 months ago

Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models

Paper • 2406.04806 • Published Jun 7 • 1 •

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Paper • 2405.06682 • Published May 5 • 3 •

New activity in dvilasuero/img-prefs-distilabel 3 months ago

Update README.md with process-howto information

#2 opened 3 months ago by

commented 3 papers 4 months ago

Probabilistic Programming with Programmable Variational Inference

Paper • 2406.15742 • Published Jun 22 • 2 •

Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

Paper • 2406.16218 • Published Jun 23 • 1 •

TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON

Paper • 2407.15734 • Published Jul 22 • 1 •

commented 3 papers 5 months ago

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

Paper • 2405.20233 • Published May 30 • 6 •

HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction

Paper • 2401.17948 • Published Jan 31 • 2 •

Extreme Compression of Large Language Models via Additive Quantization

Paper • 2401.06118 • Published Jan 11 • 12 •

New activity in sugatoray/DeepSeek-Coder-V2-Lite-Instruct-Q4_K_M-GGUF 5 months ago

Add banner image to README.md

#2 opened 5 months ago by

Upload llama.png

#1 opened 5 months ago by

commented 2 papers 6 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7 • 7 •

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20 • 34 •

commented 2 papers 7 months ago

Zero-Shot Tokenizer Transfer

Paper • 2405.07883 • Published May 13 • 5 •

Automating the Enterprise with Foundation Models

Paper • 2405.03710 • Published May 3 • 1 •

New activity in unalignment/toxic-dpo-v0.2 8 months ago

Update README.md

#2 opened 8 months ago by

New activity in HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 8 months ago

Update config.json

#11 opened 8 months ago by

New activity in mlx-community/stable-code-instruct-3b-4bit 8 months ago

Update config.json

#1 opened 8 months ago by

New activity in stabilityai/stable-code-instruct-3b 8 months ago

Update config.json with correct model-repo-name

#2 opened 8 months ago by