You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Ultron_storm_sft_20231210

Ultron is a series of LLMs ranging from 160M to 1.1B parameters.

Details of Ultron_storm_sft_20231210

Parameters: 1.1B parameters

Attention: Grouped Query Attention

Sequence Length: 2048 tokens

Learning rate: 4e-4

Dataset Size: 950B tokens

Note: This model is just a placeholder and doesn't represent the final Ultron lineup.

Downloads last month
0
Safetensors
Model size
1.1B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Datasets used to train cretone/ultron_storm_sft_20231210

Collection including cretone/ultron_storm_sft_20231210