Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,82 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model: meta-llama/Llama-3.2-1B-Instruct-bnb-4bit
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
library_name: transformers
|
6 |
+
license: llama3.2
|
7 |
+
tags:
|
8 |
+
- llama-3
|
9 |
+
- llama
|
10 |
+
- meta
|
11 |
+
- facebook
|
12 |
+
- osllmai
|
13 |
+
- transformers
|
14 |
+
---
|
15 |
+
|
16 |
+
**osllm.ai Models Highlights Program**
|
17 |
+
|
18 |
+
**We believe there's no need to pay a token if you have a GPU on your computer.**
|
19 |
+
|
20 |
+
Highlighting new and noteworthy models from the community. Join the conversation on Discord.
|
21 |
+
|
22 |
+
|
23 |
+
**Model creator**: Meta
|
24 |
+
|
25 |
+
**Original model**: Llama-3.2-1B-Instruct-bnb-4bit
|
26 |
+
|
27 |
+
|
28 |
+
[**README**:](https://huggingface.co/ibm-granite/granite-3.0-8b-instruct/edit/main/README.md)
|
29 |
+
|
30 |
+
<p align="center">
|
31 |
+
<a href="https://osllm.ai">Official Website</a> • <a href="https://docs.osllm.ai/index.html">Documentation</a> • <a href="https://discord.gg/2fftQauwDD">Discord</a>
|
32 |
+
</p>
|
33 |
+
|
34 |
+
|
35 |
+
|
36 |
+
<p align="center">
|
37 |
+
<b>NEW:</b> <a href="https://docs.google.com/forms/d/1CQXJvxLUqLBSXnjqQmRpOyZqD6nrKubLz2WTcIJ37fU/prefill">Subscribe to our mailing list</a> for updates and news!
|
38 |
+
</p>
|
39 |
+
|
40 |
+
|
41 |
+
Email: [email protected]
|
42 |
+
|
43 |
+
|
44 |
+
|
45 |
+
**Acknowledgments**
|
46 |
+
Our sincere gratitude to the Meta and Llama teams for their efforts in developing and releasing these models.
|
47 |
+
|
48 |
+
**Model Overview**
|
49 |
+
The Meta Llama 3.2 collection features multilingual large language models (LLMs), available in 1B and 3B sizes, with capabilities in both text input and output. The instruction-tuned Llama 3.2 models are optimized for multilingual dialogue, excelling in agentic retrieval and summarization tasks. They demonstrate superior performance on standard industry benchmarks compared to many open-source and closed chat models.
|
50 |
+
|
51 |
+
- **Developer**: Meta
|
52 |
+
- **Architecture**: Llama 3.2 is an auto-regressive language model utilizing an optimized transformer structure. Its tuned versions employ supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
|
53 |
+
- **Supported Languages**: Officially supported languages include English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. Llama 3.2 has been trained on a wider array of languages, and developers may further fine-tune the model for additional languages, subject to the Llama 3.2 Community License and Acceptable Use Policy. Responsible and safe deployment practices are required.
|
54 |
+
- **Token Counts**: Token references pertain solely to pretraining data. All versions employ Grouped-Query Attention (GQA) to enhance inference scalability.
|
55 |
+
|
56 |
+
**Release Information**
|
57 |
+
- **Release Date**: September 25, 2024
|
58 |
+
- **Status**: This is a static model based on an offline dataset. Future updates may further enhance model performance and safety.
|
59 |
+
- **License**: Llama 3.2 usage is governed by the Llama 3.2 Community License, [a custom commercial license agreement](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/LICENSE).
|
60 |
+
|
61 |
+
**Feedback and Further Information**
|
62 |
+
For questions or feedback regarding Llama 3.2, please refer to the model README. Additional technical details and guidance on generation parameters, as well as usage recipes, can be found [here](https://github.com/meta-llama/llama-recipes).
|
63 |
+
|
64 |
+
|
65 |
+
**Disclaimers**
|
66 |
+
|
67 |
+
[osllm.ai](https://osllm.ai) is not the creator, originator, or owner of any Model featured in the Community Model Program.
|
68 |
+
Each Community Model is created and provided by third parties. osllm.ai does not endorse, support, represent,
|
69 |
+
or guarantee the completeness, truthfulness, accuracy, or reliability of any Community Model. You understand
|
70 |
+
that Community Models can produce content that might be offensive, harmful, inaccurate, or otherwise
|
71 |
+
inappropriate, or deceptive. Each Community Model is the sole responsibility of the person or entity who
|
72 |
+
originated such Model. osllm.ai may not monitor or control the Community Models and cannot, and does not, take
|
73 |
+
responsibility for any such Model. osllm.ai disclaims all warranties or guarantees about the accuracy,
|
74 |
+
reliability, or benefits of the Community Models. osllm.ai further disclaims any warranty that the Community
|
75 |
+
Model will meet your requirements, be secure, uninterrupted, or available at any time or location, or
|
76 |
+
error-free, virus-free, or that any errors will be corrected, or otherwise. You will be solely responsible for
|
77 |
+
any damage resulting from your use of or access to the Community Models, your downloading of any Community
|
78 |
+
Model, or use of any other Community Model provided by or through [osllm.ai](https://osllm.ai).
|
79 |
+
|
80 |
+
|
81 |
+
|
82 |
+
|