Update README.md
Browse files
README.md
CHANGED
@@ -1,22 +1,115 @@
|
|
1 |
---
|
2 |
-
base_model: unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
|
3 |
language:
|
4 |
-
-
|
5 |
license: apache-2.0
|
|
|
6 |
tags:
|
7 |
- text-generation-inference
|
8 |
-
- transformers
|
9 |
- unsloth
|
10 |
- llama
|
11 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
---
|
13 |
|
14 |
-
#
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
|
16 |
-
|
17 |
-
- **License:** apache-2.0
|
18 |
-
- **Finetuned from model :** unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
|
19 |
|
20 |
-
|
21 |
|
22 |
-
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
|
1 |
---
|
|
|
2 |
language:
|
3 |
+
- it
|
4 |
license: apache-2.0
|
5 |
+
library_name: transformers
|
6 |
tags:
|
7 |
- text-generation-inference
|
|
|
8 |
- unsloth
|
9 |
- llama
|
10 |
+
- llama3.1
|
11 |
+
- trl
|
12 |
+
- word-game
|
13 |
+
- rebus
|
14 |
+
- italian
|
15 |
+
- word-puzzle
|
16 |
+
- crossword
|
17 |
+
datasets:
|
18 |
+
- gsarti/eureka-rebus
|
19 |
+
base_model: unsloth/Meta-Llama-3.1-8B-bnb-4bit
|
20 |
+
|
21 |
+
model-index:
|
22 |
+
- name: gsarti/llama-3.1-8b-rebus-solver-fp16
|
23 |
+
results:
|
24 |
+
- task:
|
25 |
+
type: verbalized-rebus-solving
|
26 |
+
name: Verbalized Rebus Solving
|
27 |
+
dataset:
|
28 |
+
type: gsarti/eureka-rebus
|
29 |
+
name: EurekaRebus
|
30 |
+
config: llm_sft
|
31 |
+
split: test
|
32 |
+
revision: 0f24ebc3b66cd2f8968077a5eb058be1d5af2f05
|
33 |
+
metrics:
|
34 |
+
- type: exact_match
|
35 |
+
value: 0.59
|
36 |
+
name: First Pass Exact Match
|
37 |
+
- type: exact_match
|
38 |
+
value: 0.56
|
39 |
+
name: Solution Exact Match
|
40 |
---
|
41 |
|
42 |
+
# LLaMA-3.1 8B Verbalized Rebus Solver - GGUF Q8_0 🇮🇹
|
43 |
+
|
44 |
+
This model is a parameter-efficient fine-tuned version of LLaMA-3.1 8B trained for verbalized rebus solving in Italian, as part of the [release](https://huggingface.co/collections/gsarti/verbalized-rebus-clic-it-2024-66ab8f11cb04e68bdf4fb028) for our paper [Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses](https://arxiv.org/abs/2408.00584). The task of verbalized rebus solving consists of converting an encrypted sequence of letters and crossword definitions into a solution phrase matching the word lengths specified in the solution key. An example is provided below.
|
45 |
+
|
46 |
+
The model was trained in 4-bit precision for 5070 steps on the verbalized subset of the [EurekaRebus](https://huggingface.co/datasets/gsarti/eureka-rebus) using QLora via [Unsloth](https://github.com/unslothai/unsloth) and [TRL](https://github.com/huggingface/trl).This repository contains the GGUF exported checkpoint of the model in `Q8_0` format, and the `Modelfile` for usage with [Ollama](https://ollama.com/) (see below).
|
47 |
+
|
48 |
+
We also provide [adapter checkpoints through training](https://huggingface.co/gsarti/llama-3.1-8b-rebus-solver-adapters) and [FP16 merged](https://huggingface.co/gsarti/gsarti/llama-3.1-8b-rebus-solver-fp16) versions of this model for analysis and local execution.
|
49 |
+
|
50 |
+
## Local usage with Ollama
|
51 |
+
|
52 |
+
A ready-to-use local version of this model is hosted on the [Ollama Hub](https://ollama.com/gsarti/llama3.1-8b-rebus-solver) and can be used as follows:
|
53 |
+
|
54 |
+
```shell
|
55 |
+
ollama run gsarti/llama3.1-8b-rebus-solver "Rebus: [Materiale espulso dai vulcani] R O [Strumento del calzolaio] [Si trovano ai lati del bacino] C I [Si ingrassano con la polenta] E I N [Contiene scorte di cibi] B [Isola in francese]\nChiave risolutiva: 1 ' 5 6 5 3 3 1 14"
|
56 |
+
|
57 |
+
# EXAMPLE GENERATION:
|
58 |
+
#
|
59 |
+
# Procediamo alla risoluzione del rebus passo per passo:
|
60 |
+
# - [Materiale espulso dai vulcani] = lava
|
61 |
+
# - R O = R O
|
62 |
+
# - [Strumento del calzolaio] = lesina
|
63 |
+
# - [Si trovano ai lati del bacino] = anche
|
64 |
+
# - C I = C I
|
65 |
+
# - [Si ingrassano con la polenta] = oche
|
66 |
+
# - E I N = E I N
|
67 |
+
# - [Contiene scorte di cibi] = dispensa
|
68 |
+
# - B = B
|
69 |
+
# - [Isola in francese] = ile
|
70 |
+
#
|
71 |
+
# Prima lettura: lava R O lesina anche C I oche E I N dispensa B ile
|
72 |
+
#
|
73 |
+
# Ora componiamo la soluzione seguendo la chiave risolutiva:
|
74 |
+
# 1 = L
|
75 |
+
# ' = '
|
76 |
+
# 5 = avaro
|
77 |
+
# 6 = lesina
|
78 |
+
# 5 = anche
|
79 |
+
# 3 = ciò
|
80 |
+
# 3 = che
|
81 |
+
# 1 = è
|
82 |
+
# 14 = indispensabile
|
83 |
+
#
|
84 |
+
# Soluzione: L'avaro lesina anche ciò che è indispensabile
|
85 |
+
```
|
86 |
+
|
87 |
+
## Limitations
|
88 |
+
|
89 |
+
**Lexical overfitting**: As remarked in the related publication, the model overfitted the set of definitions/answers for first pass words. As a result, words that were [explicitly witheld](https://huggingface.co/datasets/gsarti/eureka-rebus/blob/main/ood_words.txt) from the training set cause significant performance degradation when used as solutions for verbalized rebuses' definitions. You can compare model performances between [in-domain](https://huggingface.co/datasets/gsarti/eureka-rebus/blob/main/id_test.jsonl) and [out-of-domain](https://huggingface.co/datasets/gsarti/eureka-rebus/blob/main/ood_test.jsonl) test examples to verify this limitation.
|
90 |
+
|
91 |
+
## Model curators
|
92 |
+
|
93 |
+
For problems or updates on this model, please contact [[email protected]](mailto:[email protected]).
|
94 |
+
|
95 |
+
### Citation Information
|
96 |
+
|
97 |
+
If you use this model in your work, please cite our paper as follows:
|
98 |
+
|
99 |
+
```bibtex
|
100 |
+
@article{sarti-etal-2024-rebus,
|
101 |
+
title = "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses",
|
102 |
+
author = "Sarti, Gabriele and Caselli, Tommaso and Nissim, Malvina and Bisazza, Arianna",
|
103 |
+
journal = "ArXiv",
|
104 |
+
month = jul,
|
105 |
+
year = "2024",
|
106 |
+
volume = {abs/2408.00584},
|
107 |
+
url = {https://arxiv.org/abs/2408.00584},
|
108 |
+
}
|
109 |
+
```
|
110 |
|
111 |
+
## Acknowledgements
|
|
|
|
|
112 |
|
113 |
+
We are grateful to the [Associazione Culturale "Biblioteca Enigmistica Italiana - G. Panini"](http://www.enignet.it/home) for making its rebus collection freely accessible on the [Eureka5 platform](http://www.eureka5.it).
|
114 |
|
115 |
+
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|