Undi95 commited on
Commit
f41a969
1 Parent(s): 4337f95

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -9
README.md CHANGED
@@ -23,15 +23,6 @@ As some people have told us our models are sloppy, Ikari decided to say fuck it
23
  Our dataset stayed the same since day one, we added data over time, cleaned them, and repeat. After not releasing model for a while because we were never satisfied, we think it's time to come back!
24
 
25
 
26
- ## Credits:
27
- - Undi
28
- - IkariDev
29
-
30
- ## Training data used:
31
- We will point out all dataset we used here, please be patient the time we get them all back kek.
32
-
33
- Temporary credit for the following madlads, who contributed to the datasets we have build over time: Gryphe, Caitlyn, Kalomaze, Gifted Gummy Bee, Sao [...]
34
-
35
  ## Prompt template: Llama-3-Instruct
36
 
37
  ```
@@ -44,6 +35,51 @@ Temporary credit for the following madlads, who contributed to the datasets we h
44
  {output}<|eot_id|>
45
  ```
46
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
47
  ## Others
48
 
49
  Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).
 
23
  Our dataset stayed the same since day one, we added data over time, cleaned them, and repeat. After not releasing model for a while because we were never satisfied, we think it's time to come back!
24
 
25
 
 
 
 
 
 
 
 
 
 
26
  ## Prompt template: Llama-3-Instruct
27
 
28
  ```
 
35
  {output}<|eot_id|>
36
  ```
37
 
38
+ ## Credits:
39
+ - Undi
40
+ - IkariDev
41
+
42
+ ## Training data we used to make our dataset:
43
+
44
+ - [Epiculous/Gnosis](https://huggingface.co/Epiculous/Gnosis)
45
+ - [ChaoticNeutrals/Luminous_Opus](https://huggingface.co/datasets/ChaoticNeutrals/Luminous_Opus)
46
+ - [ChaoticNeutrals/Synthetic-Dark-RP](https://huggingface.co/datasets/ChaoticNeutrals/Synthetic-Dark-RP)
47
+ - [ChaoticNeutrals/Synthetic-RP](https://huggingface.co/datasets/ChaoticNeutrals/Synthetic-RP)
48
+ - [Gryphe/Sonnet3.5-SlimOrcaDedupCleaned](https://huggingface.co/datasets/Gryphe/Sonnet3.5-SlimOrcaDedupCleaned)
49
+ - [Gryphe/Opus-WritingPrompts](https://huggingface.co/datasets/Gryphe/Opus-WritingPrompts)
50
+ - [meseca/writing-opus-6k](https://huggingface.co/datasets/meseca/writing-opus-6k)
51
+ - [meseca/opus-instruct-9k](https://huggingface.co/datasets/meseca/opus-instruct-9k)
52
+ - [PJMixers/grimulkan_theory-of-mind-ShareGPT](https://huggingface.co/datasets/PJMixers/grimulkan_theory-of-mind-ShareGPT)
53
+ - [NobodyExistsOnTheInternet/ToxicQAFinal](https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
54
+ - [Undi95/toxic-dpo-v0.1-sharegpt](https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
55
+ - [cgato/SlimOrcaDedupCleaned](https://huggingface.co/datasets/cgato/SlimOrcaDedupCleaned)
56
+ - [kalomaze/Opus_Instruct_25k](https://huggingface.co/datasets/kalomaze/Opus_Instruct_25k)
57
+ - [Doctor-Shotgun/no-robots-sharegpt](https://huggingface.co/datasets/Doctor-Shotgun/no-robots-sharegpt)
58
+ - [Norquinal/claude_multiround_chat_30k](https://huggingface.co/datasets/Norquinal/claude_multiround_chat_30k)
59
+ - [nothingiisreal/Claude-3-Opus-Instruct-15K](https://huggingface.co/datasets/nothingiisreal/Claude-3-Opus-Instruct-15K)
60
+ - All the Aesirs dataset, cleaned, unslopped
61
+ - All le luminae dataset, cleaned, unslopped
62
+ - Small part of Airoboros reduced
63
+
64
+ We sadly didn't find the sources of the following, DM us if you recognize your set !
65
+
66
+ - Opus_Instruct-v2-6.5K-Filtered-v2-sharegpt
67
+ - claude_sharegpt_trimmed
68
+ - CapybaraPure_Decontaminated-ShareGPT_reduced
69
+
70
+ ## Datasets credits:
71
+ - Epiculous
72
+ - ChaoticNeutrals
73
+ - Gryphe
74
+ - meseca
75
+ - PJMixers
76
+ - NobodyExistsOnTheInternet
77
+ - cgato
78
+ - kalomaze
79
+ - Doctor-Shotgun
80
+ - Norquinal
81
+ - nothingiisreal
82
+
83
  ## Others
84
 
85
  Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).