Andrei A's picture

8 8

Andrei A

inwaves

·

inwaves

AI & ML interests

None yet

Recent Activity

updated a dataset 7 days ago

inwaves/magpie_ultra_length_tail

updated a dataset 9 days ago

inwaves/SkyworkReward-v0.2-PromptExtracted

upvoted a paper 17 days ago

Articles

Experimenting with different training objectives for an AI evaluator

Organizations

inwaves's activity

upvoted a paper 17 days ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16 • 42

upvoted an article 17 days ago

Article

Experimenting with different training objectives for an AI evaluator

By

•

21 days ago

• 2

upvoted a paper 28 days ago

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Paper • 2410.18451 • Published 29 days ago • 13

upvoted 2 articles 2 months ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24

• 59

Article

Let's talk about LLM evaluation

By

•

May 23

• 134

upvoted a paper 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted a paper 3 months ago

Fine-tuning Large Language Models with Human-inspired Learning Strategies in Medical Question Answering

Paper • 2408.07888 • Published Aug 15 • 11