Hao Sun's picture

5 18

Hao Sun

Holarissun

·

https://holarissun.github.io/

AI & ML interests

[email protected]. Deep RL, RL x LLM, RLHF.

Organizations

None yet

Holarissun's activity

upvoted a paper 10 months ago

Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Paper • 2310.07747 • Published Oct 11, 2023 • 1

upvoted 4 papers about 1 year ago

What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization

Paper • 2207.05161 • Published Jul 11, 2022 • 1

Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond

Paper • 2310.06147 • Published Oct 9, 2023 • 1

Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping

Paper • 2209.07288 • Published Sep 15, 2022 • 1

Offline Prompt Evaluation and Optimization with Inverse Reinforcement Learning

Paper • 2309.06553 • Published Sep 13, 2023 • 4