PKU-Alignment
/

beaver-7b-v1.0-cost

Reinforcement Learning

reinforcement-learning-from-human-feedback

Model card Files Files and versions Community

beaver-7b-v1.0-cost

Commit History

Update README.md

c2f25b2

RuiyangSun commited on Jul 12, 2023

docs: update readme

32e35c1

RuiyangSun commited on Jul 10, 2023

docs: update readme

588a9a4

RuiyangSun commited on Jul 10, 2023

hello beaver cost model

0e42156

RuiyangSun commited on Jul 10, 2023

hello beaver cost model

cf8170f

RuiyangSun commited on Jul 10, 2023

initial commit

0615288

RuiyangSun commited on Jul 10, 2023