Nifty - a bfuzzy1 Collection

bfuzzy1 's Collections

Agents

Agentic-ly agentic

Generation Nation

Don't hate - evaluate

Nifty

Nifty

updated 2 days ago

LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published Sep 18 • 30
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18 • 36
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20 • 13
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1 • 28
DiaSynth -- Synthetic Dialogue Generation Framework

Paper • 2409.19020 • Published Sep 25 • 19
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Paper • 2410.12405 • Published 26 days ago • 13
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published 11 days ago • 57
SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published 11 days ago • 20
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Paper • 2410.23918 • Published 11 days ago • 17
ATM: Improving Model Merging by Alternating Tuning and Merging

Paper • 2411.03055 • Published 6 days ago • 1
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 4 days ago • 90
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model

Paper • 2411.04496 • Published 4 days ago • 17