M Veselovskiy's picture

M Veselovskiy

Yuuru

·

AI & ML interests

None yet

Recent Activity

New activity about 1 month ago

TheDrummer/UnslopSmall-22B-v1-GGUF:Metharme format makes model extremely stupid

View all activity

Organizations

Yuuru's activity

New activity in TheDrummer/UnslopSmall-22B-v1-GGUF about 1 month ago

Metharme format makes model extremely stupid

#1 opened about 1 month ago by

upvoted a collection 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 383

New activity in G-reen/gpt5o-reflexion-q-agi-llama-3.1-8b 2 months ago

How to pay

#17 opened 2 months ago by

New activity in mattshumer/Reflection-Llama-3.1-70B 2 months ago

DLETE THIS MODEL

#76 opened 2 months ago by

Reacted to m-ric's post with 👍 3 months ago

Post

1912

🤯 𝗔 𝗻𝗲𝘄 𝟳𝟬𝗕 𝗼𝗽𝗲𝗻-𝘄𝗲𝗶𝗴𝗵𝘁𝘀 𝗟𝗟𝗠 𝗯𝗲𝗮𝘁𝘀 𝗖𝗹𝗮𝘂𝗱𝗲-𝟯.𝟱-𝗦𝗼𝗻𝗻𝗲𝘁 𝗮𝗻𝗱 𝗚𝗣𝗧-𝟰𝗼!

@mattshumer , CEO from Hyperwrite AI, had an idea he wanted to try out: why not fine-tune LLMs to always output their thoughts in specific parts, delineated by <thinking> tags?

Even better: inside of that, you could nest other sections, to reflect critically on previous output. Let’s name this part <reflection>. Planning is also put in a separate step.

He named the method “Reflection tuning” and set out to fine-tune a Llama-3.1-70B with it.

Well it turns out, it works mind-boggingly well!

🤯 Reflection-70B beats GPT-4o, Sonnet-3.5, and even the much bigger Llama-3.1-405B!

𝗧𝗟;𝗗𝗥
🥊 This new 70B open-weights model beats GPT-4o, Claude Sonnet, et al.
⏰ 405B in training, coming soon
📚 Report coming next week
⚙️ Uses GlaiveAI synthetic data
🤗 Available on HF!

I’m starting an Inference Endpoint right now for this model to give it a spin!

Check it out 👉 mattshumer/Reflection-Llama-3.1-70B

3 replies

·

New activity in yodayo-ai/kivotos-xl-2.0 6 months ago

Broken results

#1 opened 6 months ago by

liked a model 7 months ago

lllyasviel/ic-light

Updated May 8 • 169

liked a Space 7 months ago

Running on Zero

IC Light

New activity in saltlux/luxia-21.4b-alignment-v1.0 9 months ago

Quantized GGUF available

#3 opened 9 months ago by

liked a model 10 months ago

cagliostrolab/animagine-xl-3.0

Text-to-Image • Updated Jul 18 • 21.2k • 757

New activity in TheBloke/dolphin-2.6-mistral-7B-GPTQ 11 months ago

I am having issue loading dolphin 2.6 mistral 7B GPTQ:main by TheBloke . Pasting the error in the description pls help

#1 opened 11 months ago by

New activity in chargoddard/mixtralnt-4x7b-test 12 months ago

It works!!!

#1 opened 12 months ago by

New activity in TheBloke/Mixtral-8x7B-v0.1-GGUF 12 months ago

It works.

#3 opened 12 months ago by

New activity in mistralai/Mistral-7B-Instruct-v0.2 12 months ago

How is this different from v1?

#2 opened 12 months ago by

New activity in TheBlokeAI/Mixtral-tiny-GPTQ 12 months ago

What is this model?

#1 opened 12 months ago by

New activity in stabilityai/stable-video-diffusion-img2vid-xt 12 months ago

Did anyone figured it out how to run it in low vRAM like 15-25 GB

#21 opened 12 months ago by

upvoted a collection about 1 year ago

Recent models: last 100 repos, sorted by creation date

The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 506

New activity in TheBloke/Yi-34B-GPTQ about 1 year ago

How do i run it?

#2 opened about 1 year ago by

liked a Space over 1 year ago

Running on CPU Upgrade

Open LLM Leaderboard 2

Track, rank and evaluate open LLMs and chatbots

liked a model over 1 year ago

TheBloke/Wizard-Vicuna-30B-Uncensored-GGML

Updated Jun 7, 2023 • 122