Reddit AITA Finetuning V1 Collection Datasets curated from the reddit r/amithea**hole subreddit and models finetuned on them using QLoRA. • 25 items • Updated Jun 8 • 1
Deep RL Agents Collection A collection of models trained using deep RL for a variety of games. • 11 items • Updated May 8 • 1
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 603