LLaVA-Critic Collection as a general evaluator for assessing model performance • 6 items • Updated about 3 hours ago • 5
Eureka: Human-Level Reward Design via Coding Large Language Models Paper • 2310.12931 • Published Oct 19, 2023 • 26
Octopus: Embodied Vision-Language Programmer from Environmental Feedback Paper • 2310.08588 • Published Oct 12, 2023 • 34