datasets: | |
- anon8231489123/ShareGPT_Vicuna_unfiltered | |
- ehartford/wizard_vicuna_70k_unfiltered | |
- ehartford/WizardLM_alpaca_evol_instruct_70k_unfiltered | |
language: | |
- en | |
library_name: transformers | |
pipeline_tag: text-generation | |
# Wizard Mega 13B - Pre-Release (Epoch One) | |
Wizard Mega is a Llama 13B model fine-tuned on the ShareGPT, WizardLM, and Wizard-Vicuna datasets. These particular datasets have all been filtered to remove responses where the model responds with "As an AI language model...", etc or when the model refuses to respond. | |
# Build | |
Wizard Mega was built with [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) on 8xA100 80GB for 15 hours. The configuration to duplicate this build is provided in this repo's [/config folder](https://huggingface.co/openaccess-ai-collective/wizard-mega-13b/tree/main/configs). | |