lmzheng commited on
Commit
6ce2caf
1 Parent(s): 7976eed

Update model card

Browse files
Files changed (1) hide show
  1. README.md +35 -3
README.md CHANGED
@@ -3,6 +3,38 @@ license: apache-2.0
3
  inference: false
4
  ---
5
 
6
- **NOTE: This "delta model" cannot be used directly.**
7
- Users have to apply it on top of the original LLaMA weights.
8
- See https://github.com/lm-sys/FastChat#vicuna-weights for instructions.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  inference: false
4
  ---
5
 
6
+ **NOTE: This "delta model" cannot be used directly.**
7
+ Users have to apply it on top of the original LLaMA weights to get actual Vicuna weights.
8
+ See https://github.com/lm-sys/FastChat#vicuna-weights for instructions.
9
+
10
+ # Vicuna Model Card
11
+
12
+ ## Model details
13
+
14
+ **Model type**
15
+ Vicuna-13B is an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT.
16
+
17
+ **Organizations developing the model**
18
+ The Vicuna team with members from UC Berkeley, CMU, Stanford, and UC San Diego.
19
+
20
+ **Paper or resources for more information**
21
+ https://vicuna.lmsys.org/
22
+
23
+ **License**
24
+ Apache License 2.0
25
+
26
+ **Where to send questions or comments about the model**
27
+ https://github.com/lm-sys/FastChat/issues
28
+
29
+ ## Intended use
30
+ **Primary intended uses**
31
+ The primary use of Vicuna is research on large language models and chatbots.
32
+
33
+ **Primary intended users**
34
+ The primary intended users of the model are researchers and hobbyists in natural language processing, machine learning, and artificial intelligence.
35
+
36
+ ## Training dataset
37
+ 70K conversations collected from ShareGPT.com.
38
+
39
+ ## Evaluation dataset
40
+ A preliminary evaluation of the model quality is conducted by creating a set of 80 diverse questions and utilizing GPT-4 to judge the model outputs. See https://vicuna.lmsys.org/ for more details.