-
MambaByte: Token-free Selective State Space Model
Paper • 2401.13660 • Published • 49 -
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 138 -
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Paper • 2401.04081 • Published • 70 -
hustvl/Vim-tiny
Updated • 19
Michael Schock
mjschock
AI & ML interests
None yet
Organizations
None yet
Collections
1
spaces
1
models
33
mjschock/TinyLlama-1.1B-Chat-v1.0-qlora-ultrachat
Updated
mjschock/TinyLlama-1.1B-2.5T-chat-and-function-calling-Q4_K_M-GGUF
Text Generation
•
Updated
•
8
mjschock/TinyLlama-1.1B-Chat-v1.0-Q8_0-GGUF
Updated
•
18
•
1
mjschock/SmolLM-135M-Q4_K_M-GGUF
Updated
•
11
•
1
mjschock/open_llama_3b_v2-Q8_0-GGUF
Updated
•
7
•
1
mjschock/TinySolar-248m-4k-py-Q4_K_M-GGUF
Updated
•
3
•
1
mjschock/TinySolar-248m-4k-Q4_K_M-GGUF
Updated
•
2
•
1
mjschock/TinySolar-248m-4k-code-instruct-Q4_K_M-GGUF
Updated
•
10
•
2
mjschock/TinyLlama_v1.1_math_code-Q4_K_M-GGUF
Updated
•
9
•
3
mjschock/TinyLlama-1.1B-Chat-v1.0-Q4_K_M-GGUF
Updated
•
255
•
1