Csaba Kecskemeti PRO

csabakecskemeti

AI & ML interests

None yet

Recent Activity

updated a model about 4 hours ago
DevQuasar/allenai.Llama-3.1-Tulu-3-8B-GGUF
updated a model about 5 hours ago
DevQuasar/AIDC-AI.Marco-o1-GGUF
View all activity

Organizations

csabakecskemeti's activity

Reacted to fdaudens's post with 😎 4 days ago
view post
Post
1544
🚀 @Qwen just dropped 2.5-Turbo!

1M token context (that's entire "War and Peace"!) + 4.3x faster processing speed. Same price, way more power 🔥

Check out the demo: Qwen/Qwen2.5-Turbo-1M-Demo

#QWEN
posted an update 5 days ago
view post
Post
1196
Some time ago, I built a predictive LLM router that routes chat requests between small and large LLM models based on prompt classification. It dynamically selects the most suitable model depending on the complexity of the user input, ensuring optimal performance while maintaining conversation context. I also fine-tuned a RoBERTa model to use with the package, but you can plug and play any classifier of your choice.

Project's homepage:
https://devquasar.com/llm-predictive-router/
Pypi:
https://pypi.org/project/llm-predictive-router/
Model:
DevQuasar/roberta-prompt_classifier-v0.1
Training data:
DevQuasar/llm_router_dataset-synth
Git:
https://github.com/csabakecskemeti/llm_predictive_router_package

Feel free to check it out, and/or contribute.