Robert Sinclair

ZeroWw

AI & ML interests

LLMs optimization (model quantization and back-end optimizations) so that LLMs can run on computers of people with both kidneys. Discord: https://discord.com/channels/@robert_46007

Recent Activity

New activity 21 days ago

DavidAU/L3.2-Rogue-Creative-Instruct-Uncensored-Abliterated-7B-GGUF:Brainstorm 40x method developed by David_AU

New activity 25 days ago

DavidAU/L3.2-Rogue-Creative-Instruct-Uncensored-Abliterated-7B-GGUF:Silly version

updated a model 25 days ago

ZeroWw/L3.2-Rogue-Creative-Instruct-Uncensored-Abliterated-7B-D_AU-SILLY

View all activity

Organizations

ZeroWw's activity

New activity in DavidAU/L3.2-Rogue-Creative-Instruct-Uncensored-Abliterated-7B-GGUF 21 days ago

Brainstorm 40x method developed by David_AU

#1 opened 25 days ago by

ZeroWw

New activity in DavidAU/L3.2-Rogue-Creative-Instruct-Uncensored-Abliterated-7B-GGUF 25 days ago

Silly version

#2 opened 25 days ago by

ZeroWw

updated a model 25 days ago

ZeroWw/L3.2-Rogue-Creative-Instruct-Uncensored-Abliterated-7B-D_AU-SILLY

Text Generation • Updated 25 days ago • 51

replied to TuringsSolutions's post 27 days ago

hence my idea of the SILLY versions... ;)

replied to TuringsSolutions's post 27 days ago

I am pretty sure that the actual models "AS THEY ARE" could perform 10 times better using chain of thought and some algorithms like these. Without needing a different training. And I think that's probably what CLAUDE does,

Reacted to TuringsSolutions's post with ❤️ 27 days ago

Post

2103

Transformers are not all we need, that is being proven repeatedly now as more alternative frameworks emerge. Another such framework is Kolmogorov Arnold Network based Transformers. I break down exactly how these differ from Perceptron based Transformers and give you the link to my Colab where I create a model based on the research paper that absolutely destroys a standard Transformers based model. Check out the video here: https://www.youtube.com/watch?v=Sw0euxNZCc4

Reacted to TuringsSolutions's post with ❤️ 27 days ago

Post

1408

I think Reinforcement Learning is the future, for a lot of reasons. I spell them out for you in this video, and also provide you with the basic code to get up and running with Atari and OpenAI Gym. If you want to get into RL, this is your ticket. Link to a cool training montage of the model in the description of the video as well. Step 2 from here would be the full-on training and certification that HuggingFace offers for RL.

https://youtu.be/ueZl3A36ZQk

New activity in TuringsSolutions/Phi3Unlocked 27 days ago

My quants and silly expriment.

#1 opened 28 days ago by

ZeroWw

updated 2 models 28 days ago

ZeroWw/Mistral-NeMo-Minitron-8B-Instruct-SILLY

Text Generation • Updated 28 days ago • 44

ZeroWw/Mistral-NeMo-Minitron-8B-Instruct-GGUF

Text Generation • Updated 28 days ago • 221

New activity in CohereForAI/aya-expanse-8b 28 days ago

Any chance of a 1B/2B/3B/4B model?

#5 opened 29 days ago by

ZeroWw

updated a model 28 days ago

ZeroWw/Phi3Unlocked-SILLY

Text Generation • Updated 28 days ago • 26

updated a model 29 days ago

ZeroWw/Phi3Unlocked-GGUF

Text Generation • Updated 29 days ago • 161

Reacted to TuringsSolutions's post with 👍 29 days ago

Post

1380

Ever wondered how neural networks actually work under the hood?

In my latest video, I break down the core mathematical concepts behind neural networks in a way that's easy for IT professionals to understand. We'll explore:

- Neurons as logic gates
- Weighted sums and activation functions
- Gradient descent and backpropagation

No complex equations or jargon, just clear explanations and helpful visuals!

➡️ Watch now and unlock the mysteries of neural networks: https://youtu.be/L5_I1ZHoGnM

updated 6 models 29 days ago