solidrust
/

Hermes-2-Pro-Llama-3-8B-AWQ

Model card Files Files and versions Community

Suparious commited on May 2

Commit

3172157

•

1 Parent(s): 4c501e5

Update README.md

Files changed (1) hide show

README.md +41 -0

README.md CHANGED Viewed

@@ -6,6 +6,33 @@ tags:
 - text-generation
 - autotrain_compatible
 - endpoints_compatible
 pipeline_tag: text-generation
 inference: false
 quantized_by: Suparious
@@ -15,7 +42,21 @@ quantized_by: Suparious
 - Model creator: [NousResearch](https://huggingface.co/NousResearch)
 - Original model: [Hermes-2-Pro-Llama-3-8B](https://huggingface.co/NousResearch/Hermes-2-Pro-Llama-3-8B)
 ## How to use

 - text-generation
 - autotrain_compatible
 - endpoints_compatible
+- Llama-3
+- instruct
+- finetune
+- chatml
+- DPO
+- RLHF
+- gpt4
+- synthetic data
+- distillation
+- function calling
+- json mode
+- axolotl
+model-index:
+- name: Hermes-2-Pro-Llama-3-8B
+  results: []
+license: apache-2.0
+language:
+- en
+datasets:
+- teknium/OpenHermes-2.5
+widget:
+- example_title: Hermes 2 Pro
+  messages:
+  - role: system
+    content: You are a sentient, superintelligent artificial general intelligence, here to teach and assist me.
+  - role: user
+    content: Write a short story about Goku discovering kirby has teamed up with Majin Buu to destroy the world.
 pipeline_tag: text-generation
 inference: false
 quantized_by: Suparious
 - Model creator: [NousResearch](https://huggingface.co/NousResearch)
 - Original model: [Hermes-2-Pro-Llama-3-8B](https://huggingface.co/NousResearch/Hermes-2-Pro-Llama-3-8B)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/ggO2sBDJ8Bhc6w-zwTx5j.png)
+## Model Description
+Hermes 2 Pro is an upgraded, retrained version of Nous Hermes 2, consisting of an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house.
+This new version of Hermes maintains its excellent general task and conversation capabilities - but also excels at Function Calling, JSON Structured Outputs, and has improved on several other metrics as well, scoring a 90% on our function calling evaluation built in partnership with Fireworks.AI, and an 84% on our structured JSON Output evaluation.
+Hermes Pro takes advantage of a special system prompt and multi-turn function calling structure with a new chatml role in order to make function calling reliable and easy to parse. Learn more about prompting below.
+This version of Hermes 2 Pro adds several tokens to assist with agentic capabilities in parsing while streaming tokens - `<tools>`, `<tool_call>`, `<tool_response>` and their closing tags are single tokens now.
+This work was a collaboration between Nous Research, @interstellarninja, and Fireworks.AI
+Learn more about the function calling system for this model on our github repo here: https://github.com/NousResearch/Hermes-Function-Calling
 ## How to use