2 11 17

Alexandre TL

alexandretl

https://www.youtube.com/@alexandretl

AI & ML interests

None yet

Recent Activity

updated a model 4 days ago

alexandretl/ngpt

upvoted an article 9 days ago

liked a model 21 days ago

Etched/oasis-500m

Organizations

None yet

alexandretl's activity

upvoted an article 9 days ago

Article

Releasing the largest multilingual open pretraining dataset

•

9 days ago

• 94

upvoted 2 articles 3 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14

• 50

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 102

upvoted a paper 6 months ago

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26 • 22

upvoted 2 papers 7 months ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30 • 108

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14 • 43

upvoted an article 7 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22

• 78

upvoted a paper 9 months ago

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Paper • 2402.04248 • Published Feb 6 • 30

upvoted a paper 10 months ago

Learning Universal Predictors

Paper • 2401.14953 • Published Jan 26 • 19

upvoted a paper 11 months ago

Large Language Models as Generalizable Policies for Embodied Tasks

Paper • 2310.17722 • Published Oct 26, 2023 • 6

upvoted a paper about 1 year ago

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Paper • 2308.02151 • Published Aug 4, 2023 • 18