bprice9
/

Palmyra-Medical-70B-FP8

Text Generation

Model card Files Files and versions Community

bprice9 commited on Aug 6

Commit

e420aae

•

1 Parent(s): 3819ab8

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ The original model performance on biomedical benchmarks is 85.87%.
 - **Model Optimizations:**
   - **Weight quantization:** FP8
   - **Activation quantization:** FP8
-- **Intended Use Cases:** Palmyra-Med-70B is intended for non-commercial and research use in English. Instruction tuned models are intended for assistant-like chat, whereas pretrained models can be adapted for a variety of natural language generation tasks.
 - **Out-of-scope:** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in languages other than English.
 - **License(s):** [writer-open-model-license](https://writer.com/legal/open-model-license/)
@@ -47,7 +47,7 @@ This model can be deployed using the [vLLM](https://docs.vllm.ai/en/latest/) lib
 from vllm import LLM, SamplingParams
 from transformers import AutoTokenizer
-model_id = "bprice9/Palmyra-Med-70B-FP8"
 number_gpus = 2
 sampling_params = SamplingParams(temperature=0.0, top_p=0.9, max_tokens=512, stop_token_ids=[128001, 128009])
@@ -157,7 +157,7 @@ oneshot(
    </td>
    <td style="width: 20%;"><strong>Palmyra-Med-70B (Original FP16)</strong>
    </td>
-   <td style="width: 20%;"><strong>Palmyra-Med-70B-FP8 (This Model)</strong>
    </td>
   </tr>
   <tr>

 - **Model Optimizations:**
   - **Weight quantization:** FP8
   - **Activation quantization:** FP8
+- **Intended Use Cases:** Palmyra-Medical-70B-FP8 is intended for non-commercial and research use in English. Instruction tuned models are intended for assistant-like chat, whereas pretrained models can be adapted for a variety of natural language generation tasks.
 - **Out-of-scope:** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in languages other than English.
 - **License(s):** [writer-open-model-license](https://writer.com/legal/open-model-license/)
 from vllm import LLM, SamplingParams
 from transformers import AutoTokenizer
+model_id = "bprice9/Palmyra-Medical-70B-FP8"
 number_gpus = 2
 sampling_params = SamplingParams(temperature=0.0, top_p=0.9, max_tokens=512, stop_token_ids=[128001, 128009])
    </td>
    <td style="width: 20%;"><strong>Palmyra-Med-70B (Original FP16)</strong>
    </td>
+   <td style="width: 20%;"><strong>Palmyra-Medical-70B-FP8 (This Model)</strong>
    </td>
   </tr>
   <tr>