1 5 12

Gusti Triandi Winata

sanggusti

https://sanggusti.tech

AI & ML interests

MLSys, RL/SSL, Applied ML

Recent Activity

authored a paper about 1 month ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

commented a paper about 1 month ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

upvoted a paper about 1 month ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

View all activity

Organizations

sanggusti's activity

authored a paper about 1 month ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20 • 10

commented a paper about 1 month ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20 • 10 •

upvoted a paper about 1 month ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20 • 10

upvoted a collection about 1 month ago

Multilingual RewardBench

Collection

Multilingual Reward Model Evaluation Dataset and Results • 2 items • Updated Oct 26 • 4

liked a Space about 2 months ago

Sleeping

👀

Let Me Label You

upvoted an article 3 months ago

Article

Scaling robotics datasets with video encoding

Aug 27

• 34

liked a Space 5 months ago

Running on Zero

1.5k

📺

Stable Video Diffusion 1.1

liked 3 models 5 months ago

upvoted 2 articles 5 months ago

Article

Putting RL back in RLHF

Jun 12

• 62

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 102

liked a model 5 months ago

CohereForAI/aya-101

Text2Text Generation • Updated Mar 31 • 2.95k • 617

liked 4 Spaces 6 months ago

Running on Zero

649

⚡

Unique3D

Create a 1M faces 3D colored model from an image!

Running

143

🤖

NPC Playground

Paused

🔥

Llava Next

Runtime error

🩻

CheXRay

A vision model that reads Chest X-Rays

liked a model 7 months ago

zhiqings/LLaVA-RLHF-13b-v1.5-336

Updated Nov 1, 2023 • 19

updated a model 8 months ago

sanggusti/lunar_lander

Reinforcement Learning • Updated Apr 15 • 1

liked a Space 10 months ago

Running on A10G

205

🔥

Gusti Triandi Winata

AI & ML interests

Recent Activity

Organizations

sanggusti's activity

Let Me Label You

Scaling robotics datasets with video encoding

Stable Video Diffusion 1.1

Putting RL back in RLHF

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Unique3D

NPC Playground

Llava Next

CheXRay

Controlnet for Interior Design