openaccess-ai-collective
/

wizard-mega-13b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

wizard-mega-13b / README.md

winglian's picture

update README and add config file

d0579e3 over 1 year ago

|

847 Bytes

	---
	datasets:
	- anon8231489123/ShareGPT_Vicuna_unfiltered
	- ehartford/wizard_vicuna_70k_unfiltered
	- ehartford/WizardLM_alpaca_evol_instruct_70k_unfiltered
	language:
	- en
	library_name: transformers
	pipeline_tag: text-generation
	---

	# Wizard Mega 13B - Pre-Release (Epoch One)

	Wizard Mega is a Llama 13B model fine-tuned on the ShareGPT, WizardLM, and Wizard-Vicuna datasets. These particular datasets have all been filtered to remove responses where the model responds with "As an AI language model...", etc or when the model refuses to respond.

	# Build

	Wizard Mega was built with [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) on 8xA100 80GB for 15 hours. The configuration to duplicate this build is provided in this repo's [/config folder](https://huggingface.co/openaccess-ai-collective/wizard-mega-13b/tree/main/configs).