This is a merge of pre-trained language models created based on DARE using mergekit.
More descriptions of the model will be added soon.
Loading the Model
Use the following Python code to load the model:
import torch
from transformers import MistralForCausalLM, AutoTokenizer
model = MistralForCausalLM.from_pretrained("vanillaOVO/supermario_v2", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained("vanillaOVO/supermario_v2")
Generating Text
To generate text, use the following Python code:
text = "Large language models are "
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=256)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
- Downloads last month
- 556
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.