DeepDream2045
/

neuraldaredevil-exl2-6bpw

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

neuraldaredevil-exl2-6bpw / README.md

DeepDream2045's picture

Initial commit

a658ab1 4 months ago

|

history blame contribute delete

3.39 kB

	### exl2 quant (measurement.json included)
	---
	### original readme below
	---
	---
	license: cc-by-nc-4.0
	base_model: mlabonne/Daredevil-7B
	tags:
	- merge
	- mergekit
	- lazymergekit
	- dpo
	- rlhf
	- mlabonne/example
	---

	![](https://i.imgur.com/D80Ua7T.png)

	# NeuralDaredevil-7B

	NeuralDaredevil-7B is a DPO fine-tune of [mlabonne/Daredevil-7B](https://huggingface.co/mlabonne/Daredevil-7B) using the [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs) preference dataset and my DPO notebook from [this article](https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac).

	Thanks [Argilla](https://huggingface.co/argilla) for providing the dataset and the training recipe [here](https://huggingface.co/argilla/distilabeled-Marcoro14-7B-slerp). 💪

	## 🏆 Evaluation

	The evaluation was performed using [LLM AutoEval](https://github.com/mlabonne/llm-autoeval) on Nous suite.

	\| Model \| Average \| AGIEval \| GPT4All \| TruthfulQA \| Bigbench \|
	\|---\|---:\|---:\|---:\|---:\|---:\|
	\| [mlabonne/NeuralDaredevil-7B](https://huggingface.co/mlabonne/NeuralDaredevil-7B) [📄](https://gist.github.com/mlabonne/cbeb077d1df71cb81c78f742f19f4155) \| 59.39 \| 45.23 \| 76.2 \| 67.61 \| 48.52 \|
	\| [mlabonne/Beagle14-7B](https://huggingface.co/mlabonne/Beagle14-7B) [📄](https://gist.github.com/mlabonne/f5a5bf8c0827bbec2f05b97cc62d642c) \| 59.4 \| 44.38 \| 76.53 \| 69.44 \| 47.25 \|
	\| [argilla/distilabeled-Marcoro14-7B-slerp](https://huggingface.co/argilla/distilabeled-Marcoro14-7B-slerp) [📄](https://gist.github.com/mlabonne/9082c4e59f4d3f3543c5eda3f4807040) \| 58.93 \| 45.38 \| 76.48 \| 65.68 \| 48.18 \|
	\| [mlabonne/NeuralMarcoro14-7B](https://huggingface.co/mlabonne/NeuralMarcoro14-7B) [📄](https://gist.github.com/mlabonne/b31572a4711c945a4827e7242cfc4b9d) \| 58.4 \| 44.59 \| 76.17 \| 65.94 \| 46.9 \|
	\| [openchat/openchat-3.5-0106](https://huggingface.co/openchat/openchat-3.5-0106) [📄](https://gist.github.com/mlabonne/1afab87b543b0717ec08722cf086dcc3) \| 53.71 \| 44.17 \| 73.72 \| 52.53 \| 44.4 \|
	\| [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B) [📄](https://gist.github.com/mlabonne/88b21dd9698ffed75d6163ebdc2f6cc8) \| 52.42 \| 42.75 \| 72.99 \| 52.99 \| 40.94 \|

	You can find the complete benchmark on [YALL - Yet Another LLM Leaderboard](https://huggingface.co/spaces/mlabonne/Yet_Another_LLM_Leaderboard).

	## 💻 Usage

	```python
	!pip install -qU transformers accelerate

	from transformers import AutoTokenizer
	import transformers
	import torch

	model = "mlabonne/NeuralDaredevil-7B"
	messages = [{"role": "user", "content": "What is a large language model?"}]

	tokenizer = AutoTokenizer.from_pretrained(model)
	prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
	pipeline = transformers.pipeline(
	"text-generation",
	model=model,
	torch_dtype=torch.float16,
	device_map="auto",
	)

	outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
	print(outputs[0]["generated_text"])
	```

	<p align="center">
	<a href="https://github.com/argilla-io/distilabel">
	<img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/>
	</a>
	</p>