PKU-Alignment
/

beaver-7b-v1.0-cost

Reinforcement Learning

reinforcement-learning-from-human-feedback

Model card Files Files and versions Community

beaver-7b-v1.0-cost

Commit History

docs: update readme

588a9a4

RuiyangSun commited on Jul 10, 2023

hello beaver cost model

0e42156

RuiyangSun commited on Jul 10, 2023

hello beaver cost model

cf8170f

RuiyangSun commited on Jul 10, 2023

initial commit

0615288

RuiyangSun commited on Jul 10, 2023