Zoltan Csaki

zolicsaki

AI & ML interests

None yet

Recent Activity

liked a Space 26 days ago
sambanovasystems/Pictionary
liked a Space about 2 months ago
KingNish/Live-Video-Chat
liked a model about 2 months ago
sbintuitions/sarashina2-70b
View all activity

Organizations

zolicsaki's activity

updated a Space about 2 months ago
liked a Space 2 months ago
Reacted to kz919's post with πŸš€ 2 months ago
view post
Post
1257
Just for the meme.

But the clear lesson I learnt from building these demos are, the more powerful the underlying base model is, the closer you will get to GPT4o1. CoT is nothing more than simply inducing the latent reasoning capability from the model.

kz919/GPT4-O1-Proximas
Reacted to KingNish's post with πŸ‘ 2 months ago
posted an update 2 months ago
view post
Post
1247
We’ve open-sourced an app, powered by SambaNova Cloud and Llama 405B, that intelligently detects when a web search is neededβ€”then answers directly or with RAG.

sambanovasystems/auto-web-search

πŸ₯š A hidden Easter egg is that Auto Search detection is already trained into Llama 3.1 checkpoints. Simply use the tool usage system prompt below, and the model will either respond with a web search query if it deems necessary or respond to the query directly.πŸ₯š

Environment: IPython
Tools: Brave Search
Knowledge Cutoff Date: December 2023
Today's Date: September 2024
You are a helpful assistant. Reminder:
Search function calls MUST follow the specified format: "brave_search.call(query)"

You can see the documentation here
https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_1#built-in-tooling
and read about how the tool usage was trained into Llama3.1 models in section 4.3.5 here https://arxiv.org/pdf/2407.21783
posted an update 2 months ago
view post
Post
1287
Fast inference is no longer a nice-to-have demo; it will be the driving force behind future frontier models. Time to switch over to custom AI hardware and short Nvidia.

Try out SambaNova's lightning fast API for free at https://sambanova.ai/fast-api?api_ref=444868
liked a Space 2 months ago
Reacted to kz919's post with πŸ§ πŸ€―πŸ€—πŸ”₯πŸš€πŸ˜Ž 3 months ago
Reacted to their post with πŸ€— 3 months ago
view post
Post
1809
You can run Llama405B at over 100 tokens per second for free using SambaNova's API! https://sambanova.ai/fast-api?api_ref=444868

I have been able to generate some high quality synthetic data and use it as an LLM as a judge instead of the slower and more expensive alternatives like openAI or Anthropic.

  • 2 replies
Β·
replied to their post 3 months ago
view reply

@gghfez all you need is a valid email, I think they send out the API keys once a day when they approve you. They approve everyone unless they think its a spam trying to get more then one key