Models for the paper Cottention: Linear Transformers With Cosine Attention https://arxiv.org/abs/2409.18747
Gabriel Mongaras
gmongaras
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 months ago
gmongaras/Softmax_Attention_BERT
updated
a model
about 2 months ago
gmongaras/Cosine_Attention_BERT
updated
a model
about 2 months ago
gmongaras/Cosine_Attention_GPT_1.2B
Organizations
Collections
5
Papers
1
models
18
gmongaras/Softmax_Attention_BERT
Feature Extraction
•
Updated
•
6
gmongaras/Cosine_Attention_BERT
Feature Extraction
•
Updated
•
6
gmongaras/Cosine_Attention_GPT_1.2B
Feature Extraction
•
Updated
•
2
gmongaras/Cosine_Attention_GPT_300M
Feature Extraction
•
Updated
•
2
gmongaras/Softmax_Attention_GPT_1.2B
Feature Extraction
•
Updated
•
2
gmongaras/Softmax_Attention_GPT_300M
Feature Extraction
•
Updated
•
2
gmongaras/Yann_UWU
Text Generation
•
Updated
•
10
gmongaras/Meta-Llama-3.1-8B
Text Generation
•
Updated
•
10
gmongaras/reddit_negative_v1_13B
Text Generation
•
Updated
•
8
•
1
gmongaras/Wizard_7B_Squad_v2
Text Generation
•
Updated
•
11
datasets
21
gmongaras/Elon_Tweets_Score
Viewer
•
Updated
•
5.9k
•
34
gmongaras/Elon_Tweets
Viewer
•
Updated
•
5.9k
•
43
gmongaras/Pile_Llama_Tokenized
Updated
•
8
gmongaras/Anime_Subtitle_data2
Viewer
•
Updated
•
1.91M
•
43
gmongaras/Anime_Subtitle_data
Viewer
•
Updated
•
14.6M
•
40
gmongaras/BERT_Base_Cased_128_Dataset_Mapped
Viewer
•
Updated
•
132M
•
822
gmongaras/BERT_Base_Cased_128_Dataset
Viewer
•
Updated
•
134M
•
275
gmongaras/Yann_LeCun_Tweets
Viewer
•
Updated
•
406
•
48
gmongaras/dummy_text_dataset
Viewer
•
Updated
•
2.05k
•
88
gmongaras/EleutherAI_the_pile_deduplicated
Viewer
•
Updated
•
134M
•
212
•
2