Sayhan Yalvaçer

sayhan

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
lmstudio-community/aya-expanse-32b-GGUF
liked a model about 1 month ago
CohereForAI/aya-expanse-32b
View all activity

Organizations

sayhan's activity

Reacted to DmitryRyumin's post with 🔥 7 months ago
view post
Post
2154
🔥🚀🌟 New Research Alert - xLSTM! 🌟🚀🔥
📄 Title: xLSTM: Extended Long Short-Term Memory 🔝

📝 Description: xLSTM is a scaled-up LSTM architecture with exponential gating and modified memory structures to mitigate known limitations. xLSTM blocks outperform SOTA transformers and state-space models in performance and scaling.

👥 Authors: Maximilian Beck et al.

📄 Paper: xLSTM: Extended Long Short-Term Memory (2405.04517)

📁 Repository: https://github.com/NX-AI/xlstm

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #xLSTM #DeepLearning #Innovation #AI
  • 1 reply
·