arxiv:2409.13757
Ditto
dittops
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
Efficient Hybrid Inference for LLMs: Reward-Based Token Modelling with
Selective Cloud Assistance