doberst commited on
Commit
f660e88
1 Parent(s): fbfa29e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -9
README.md CHANGED
@@ -1,15 +1,14 @@
1
  ---
2
- license: cc-by-sa-4.0
 
3
  ---
4
 
5
- # SLIM-XSUM-TOOL
6
 
7
  <!-- Provide a quick summary of what the model is/does. -->
8
 
9
 
10
- **slim-xsum-tool** is a 4_K_M quantized GGUF version of slim-xsum, providing a small, fast inference implementation, optimized for multi-model concurrent deployment.
11
-
12
- This model implements an 'extreme summarization' (e.g., 'xsum') function based on the parameter key "xsum" that generates an LLM text output in the form of a python dictionary as follows:
13
 
14
  `{'xsum': ['Stock Market declines on worries of interest rates.']} `
15
 
@@ -21,7 +20,7 @@ The intent of SLIMs is to forge a middle-ground between traditional encoder-base
21
  To pull the model via API:
22
 
23
  from huggingface_hub import snapshot_download
24
- snapshot_download("llmware/slim-xsum-tool", local_dir="/path/on/your/machine/", local_dir_use_symlinks=False)
25
 
26
 
27
  Load in your favorite GGUF inference engine, or try with llmware as follows:
@@ -29,14 +28,14 @@ Load in your favorite GGUF inference engine, or try with llmware as follows:
29
  from llmware.models import ModelCatalog
30
 
31
  # to load the model and make a basic inference
32
- model = ModelCatalog().load_model("slim-xsum-tool")
33
  response = model.function_call(text_sample)
34
 
35
  # this one line will download the model and run a series of tests
36
- ModelCatalog().tool_test_run("slim-xsum-tool", verbose=True)
37
 
38
 
39
- Note: please review [**config.json**](https://huggingface.co/llmware/slim-xsum-tool/blob/main/config.json) in the repository for prompt wrapping information, details on the model, and full test set.
40
 
41
 
42
  ## Model Card Contact
 
1
  ---
2
+ license: apache-2.0
3
+ inference: false
4
  ---
5
 
6
+ # SLIM-XSUM-PHI-3-GGUF
7
 
8
  <!-- Provide a quick summary of what the model is/does. -->
9
 
10
 
11
+ **slim-xsum-phi-3-gguf** is a fine-tune of Phi-3 that implements an 'extreme summarization' (e.g., 'xsum') function call based on the parameter key "xsum" that generates an LLM text output in the form of a python dictionary as follows:
 
 
12
 
13
  `{'xsum': ['Stock Market declines on worries of interest rates.']} `
14
 
 
20
  To pull the model via API:
21
 
22
  from huggingface_hub import snapshot_download
23
+ snapshot_download("llmware/slim-xsum-phi-3-gguf", local_dir="/path/on/your/machine/", local_dir_use_symlinks=False)
24
 
25
 
26
  Load in your favorite GGUF inference engine, or try with llmware as follows:
 
28
  from llmware.models import ModelCatalog
29
 
30
  # to load the model and make a basic inference
31
+ model = ModelCatalog().load_model("slim-xsum-phi-3-gguf")
32
  response = model.function_call(text_sample)
33
 
34
  # this one line will download the model and run a series of tests
35
+ ModelCatalog().tool_test_run("slim-xsum-phi-3-gguf", verbose=True)
36
 
37
 
38
+ Note: please review [**config.json**](https://huggingface.co/llmware/slim-xsum-phi-3-gguf/blob/main/config.json) in the repository for prompt wrapping information, details on the model, and full test set.
39
 
40
 
41
  ## Model Card Contact