Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
fla-hub
/
gsa-2.7B-100B
like
0
Text Generation
Transformers
Safetensors
gsa
Inference Endpoints
arxiv:
2409.07146
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
Edit model card
YAML Metadata Warning:
empty or missing yaml metadata in repo card (
https://huggingface.co/docs/hub/model-cards#model-card-metadata
)
Model of the paper
Gated Slot Attention for Efficient Linear-Time Sequence Modeling
.
Downloads last month
44
Safetensors
Model size
2.73B params
Tensor type
BF16
·
Inference Examples
Text Generation
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to
Inference Endpoints (dedicated)
instead.
Collection including
fla-hub/gsa-2.7B-100B
GSA
Collection
3 items
•
Updated
18 days ago
•
2