Firstname Lastname

takeraparterer

AI & ML interests

None yet

Recent Activity

Organizations

takeraparterer's activity

replied to TuringsSolutions's post about 10 hours ago
replied to TuringsSolutions's post about 11 hours ago
replied to TuringsSolutions's post about 11 hours ago
replied to TuringsSolutions's post about 11 hours ago
replied to TuringsSolutions's post about 13 hours ago
replied to TuringsSolutions's post about 13 hours ago
replied to TuringsSolutions's post about 14 hours ago
view reply

I think you should add dropout or decrease the size of the model

replied to TuringsSolutions's post about 14 hours ago
view reply

I think your model is overfitting you should add dropout or decrease the size of it

replied to TuringsSolutions's post about 14 hours ago
replied to TuringsSolutions's post about 15 hours ago
view reply

You should try dropout or decreasing the model size

replied to TuringsSolutions's post 1 day ago
view reply

What about test loss? It looks like overfitting to me.

replied to TuringsSolutions's post 1 day ago
replied to TuringsSolutions's post 6 days ago
replied to their post 7 days ago
replied to TuringsSolutions's post 7 days ago
view reply

image.png
that's a FFN, which is only a small part of an LLM

Reacted to TuringsSolutions's post with ๐Ÿ˜” 7 days ago
view post
Post
448
What if I told you that LLM models do not simply predict the next token in a sequence but instead utilize an emergent structural pattern-based system to comprehend language and concepts? I created a graph-based optimizer that not only works, but it also actually beats Adam, like very badly. I prove it thoroughly using SMOL LLM models. The secret? The graph is not what you think it is, humans. Code, full explanation, and more in this video. The Rhizome Optimizer is MIT licensed. I have completed my research. I fully understand now.

https://youtu.be/OMCRRueMhdI
  • 6 replies
ยท
replied to TuringsSolutions's post 16 days ago
view reply

has threatened violence in subtle ways

๐Ÿ˜ญ๐Ÿ˜ญ๐Ÿ˜ญ๐Ÿ˜ญ๐Ÿ˜ญ๐Ÿ˜ญ

replied to TuringsSolutions's post 16 days ago
view reply

very intriguing. looking into this ๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€

replied to TuringsSolutions's post 16 days ago
Reacted to TuringsSolutions's post with ๐Ÿ˜” 16 days ago
view post
Post
3944
Are you familiar with the difference between discrete learning and predictive learning? This distinction is exactly why LLM models are not designed to perform and execute function calls, they are not the right shape for it. LLM models are prediction machines. Function calling requires discrete learning machines. Fortunately, you can easily couple an LLM model with a discrete learning algorithm. It is beyond easy to do, you simply need to know the math to do it. Want to dive deeper into this subject? Check out this video.

https://youtu.be/wBRem2p8iPM
  • 8 replies
ยท