Text Generation
Transformers
PyTorch
English
llama
text-generation-inference
Inference Endpoints
wizard-mega-13b / README.md
winglian's picture
update README and add config file
d0579e3
|
raw
history blame
847 Bytes
metadata
datasets:
  - anon8231489123/ShareGPT_Vicuna_unfiltered
  - ehartford/wizard_vicuna_70k_unfiltered
  - ehartford/WizardLM_alpaca_evol_instruct_70k_unfiltered
language:
  - en
library_name: transformers
pipeline_tag: text-generation

Wizard Mega 13B - Pre-Release (Epoch One)

Wizard Mega is a Llama 13B model fine-tuned on the ShareGPT, WizardLM, and Wizard-Vicuna datasets. These particular datasets have all been filtered to remove responses where the model responds with "As an AI language model...", etc or when the model refuses to respond.

Build

Wizard Mega was built with Axolotl on 8xA100 80GB for 15 hours. The configuration to duplicate this build is provided in this repo's /config folder.