Prototype Models
Collection
Models developed for testing purposes
•
3 items
•
Updated
Ultron is a series of LLMs ranging from 160M to 1.1B parameters.
Parameters: 1.1B parameters
Attention: Grouped Query Attention
Sequence Length: 2048 tokens
Learning rate: 4e-4
Dataset Size: 950B tokens
Note: This model is just a placeholder and doesn't represent the final Ultron lineup.