fernandovmacedo commited on
Commit
eab6a7b
1 Parent(s): 0daee53

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +207 -0
README.md ADDED
@@ -0,0 +1,207 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
3
+ datasets:
4
+ - rhaymison/superset
5
+ language:
6
+ - pt
7
+ library_name: transformers
8
+ license: apache-2.0
9
+ pipeline_tag: text-generation
10
+ tags:
11
+ - portuguese
12
+ - phi
13
+ - text-generation-inference
14
+ - llama-cpp
15
+ - gguf-my-repo
16
+ model-index:
17
+ - name: portuguese-Phi3-Tom-Cat-128k-instruct
18
+ results:
19
+ - task:
20
+ type: text-generation
21
+ name: Text Generation
22
+ dataset:
23
+ name: ENEM Challenge (No Images)
24
+ type: eduagarcia/enem_challenge
25
+ split: train
26
+ args:
27
+ num_few_shot: 3
28
+ metrics:
29
+ - type: acc
30
+ value: 51.15
31
+ name: accuracy
32
+ source:
33
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
34
+ name: Open Portuguese LLM Leaderboard
35
+ - task:
36
+ type: text-generation
37
+ name: Text Generation
38
+ dataset:
39
+ name: BLUEX (No Images)
40
+ type: eduagarcia-temp/BLUEX_without_images
41
+ split: train
42
+ args:
43
+ num_few_shot: 3
44
+ metrics:
45
+ - type: acc
46
+ value: 42.56
47
+ name: accuracy
48
+ source:
49
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
50
+ name: Open Portuguese LLM Leaderboard
51
+ - task:
52
+ type: text-generation
53
+ name: Text Generation
54
+ dataset:
55
+ name: OAB Exams
56
+ type: eduagarcia/oab_exams
57
+ split: train
58
+ args:
59
+ num_few_shot: 3
60
+ metrics:
61
+ - type: acc
62
+ value: 39.86
63
+ name: accuracy
64
+ source:
65
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
66
+ name: Open Portuguese LLM Leaderboard
67
+ - task:
68
+ type: text-generation
69
+ name: Text Generation
70
+ dataset:
71
+ name: Assin2 RTE
72
+ type: assin2
73
+ split: test
74
+ args:
75
+ num_few_shot: 15
76
+ metrics:
77
+ - type: f1_macro
78
+ value: 88.86
79
+ name: f1-macro
80
+ source:
81
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
82
+ name: Open Portuguese LLM Leaderboard
83
+ - task:
84
+ type: text-generation
85
+ name: Text Generation
86
+ dataset:
87
+ name: Assin2 STS
88
+ type: eduagarcia/portuguese_benchmark
89
+ split: test
90
+ args:
91
+ num_few_shot: 15
92
+ metrics:
93
+ - type: pearson
94
+ value: 68
95
+ name: pearson
96
+ source:
97
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
98
+ name: Open Portuguese LLM Leaderboard
99
+ - task:
100
+ type: text-generation
101
+ name: Text Generation
102
+ dataset:
103
+ name: FaQuAD NLI
104
+ type: ruanchaves/faquad-nli
105
+ split: test
106
+ args:
107
+ num_few_shot: 15
108
+ metrics:
109
+ - type: f1_macro
110
+ value: 45.16
111
+ name: f1-macro
112
+ source:
113
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
114
+ name: Open Portuguese LLM Leaderboard
115
+ - task:
116
+ type: text-generation
117
+ name: Text Generation
118
+ dataset:
119
+ name: HateBR Binary
120
+ type: ruanchaves/hatebr
121
+ split: test
122
+ args:
123
+ num_few_shot: 25
124
+ metrics:
125
+ - type: f1_macro
126
+ value: 85.92
127
+ name: f1-macro
128
+ source:
129
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
130
+ name: Open Portuguese LLM Leaderboard
131
+ - task:
132
+ type: text-generation
133
+ name: Text Generation
134
+ dataset:
135
+ name: PT Hate Speech Binary
136
+ type: hate_speech_portuguese
137
+ split: test
138
+ args:
139
+ num_few_shot: 25
140
+ metrics:
141
+ - type: f1_macro
142
+ value: 65.76
143
+ name: f1-macro
144
+ source:
145
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
146
+ name: Open Portuguese LLM Leaderboard
147
+ - task:
148
+ type: text-generation
149
+ name: Text Generation
150
+ dataset:
151
+ name: tweetSentBR
152
+ type: eduagarcia/tweetsentbr_fewshot
153
+ split: test
154
+ args:
155
+ num_few_shot: 25
156
+ metrics:
157
+ - type: f1_macro
158
+ value: 53.32
159
+ name: f1-macro
160
+ source:
161
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct
162
+ name: Open Portuguese LLM Leaderboard
163
+ ---
164
+
165
+ # fernandovmacedo/portuguese-Phi3-Tom-Cat-128k-instruct-Q4_K_S-GGUF
166
+ This model was converted to GGUF format from [`rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct`](https://huggingface.co/rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
167
+ Refer to the [original model card](https://huggingface.co/rhaymison/portuguese-Phi3-Tom-Cat-128k-instruct) for more details on the model.
168
+
169
+ ## Use with llama.cpp
170
+ Install llama.cpp through brew (works on Mac and Linux)
171
+
172
+ ```bash
173
+ brew install llama.cpp
174
+
175
+ ```
176
+ Invoke the llama.cpp server or the CLI.
177
+
178
+ ### CLI:
179
+ ```bash
180
+ llama-cli --hf-repo fernandovmacedo/portuguese-Phi3-Tom-Cat-128k-instruct-Q4_K_S-GGUF --hf-file portuguese-phi3-tom-cat-128k-instruct-q4_k_s.gguf -p "The meaning to life and the universe is"
181
+ ```
182
+
183
+ ### Server:
184
+ ```bash
185
+ llama-server --hf-repo fernandovmacedo/portuguese-Phi3-Tom-Cat-128k-instruct-Q4_K_S-GGUF --hf-file portuguese-phi3-tom-cat-128k-instruct-q4_k_s.gguf -c 2048
186
+ ```
187
+
188
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
189
+
190
+ Step 1: Clone llama.cpp from GitHub.
191
+ ```
192
+ git clone https://github.com/ggerganov/llama.cpp
193
+ ```
194
+
195
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
196
+ ```
197
+ cd llama.cpp && LLAMA_CURL=1 make
198
+ ```
199
+
200
+ Step 3: Run inference through the main binary.
201
+ ```
202
+ ./llama-cli --hf-repo fernandovmacedo/portuguese-Phi3-Tom-Cat-128k-instruct-Q4_K_S-GGUF --hf-file portuguese-phi3-tom-cat-128k-instruct-q4_k_s.gguf -p "The meaning to life and the universe is"
203
+ ```
204
+ or
205
+ ```
206
+ ./llama-server --hf-repo fernandovmacedo/portuguese-Phi3-Tom-Cat-128k-instruct-Q4_K_S-GGUF --hf-file portuguese-phi3-tom-cat-128k-instruct-q4_k_s.gguf -c 2048
207
+ ```