Songlin Yang's picture

1 5 4

Songlin Yang

sonta7

·

https://sustcsonglin.github.io/

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago

codeparrot/codeparrot-clean-valid

liked a dataset about 1 month ago

HuggingFaceFW/fineweb-edu

liked a dataset about 1 month ago

HuggingFaceFW/fineweb-edu-score-2

View all activity

Organizations

sonta7's activity

upvoted an article about 1 month ago

Article

History of State Space Models (SSM) in 2022

By

•

Apr 11

• 14

upvoted a paper 2 months ago

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18 • 43

upvoted a paper 3 months ago

Gated Slot Attention for Efficient Linear-Time Sequence Modeling

Paper • 2409.07146 • Published Sep 11 • 19

upvoted a paper 6 months ago

Parallelizing Linear Transformers with the Delta Rule over Sequence Length

Paper • 2406.06484 • Published Jun 10 • 3

upvoted a collection 7 months ago

based

These language model checkpoints are trained at the 360M and 1.3Bn parameter scales for up to 50Bn tokens on the Pile corpus, for research purposes. • 15 items • Updated Oct 18 • 9