sayhan
/

gemma-7b-it-GGUF-quantized

Text Generation

Model card Files Files and versions Community

sayhan commited on Feb 23

Commit

302fb77

•

1 Parent(s): c35332e

Create README.md

Files changed (1) hide show

README.md +50 -0

README.md ADDED Viewed

	@@ -0,0 +1,50 @@

+---
+base_model: google/gemma-7b-it
+language:
+- en
+pipeline_tag: text-generation
+license: other
+model_type: gemma
+library_name: transformers
+inference: false
+---
+![image/webp](https://cdn-uploads.huggingface.co/production/uploads/65aa2d4b356bf23b4a4da247/NQAvp6NRHlNILyWWFlrA7.webp)
+## Google Gemma 7B Instruct
+- **Model creator:** [Google](https://huggingface.co/google)
+- **Original model:** [gemma-7b-it](https://huggingface.co/google/gemma-7b-it)
+- [**Terms of use**](https://www.kaggle.com/models/google/gemma/license/consent)
+<!-- description start -->
+## Description
+This repo contains GGUF format model files for [Google's Gemma 7B Instruct](https://huggingface.co/google/gemma-7b-it)
+## Original model
+- **Developed by:** [Google](https://huggingface.co/google)
+### Description
+Gemma is a family of lightweight, state-of-the-art open models from Google,
+built from the same research and technology used to create the Gemini models.
+They are text-to-text, decoder-only large language models, available in English,
+with open weights, pre-trained variants, and instruction-tuned variants. Gemma
+models are well-suited for a variety of text generation tasks, including
+question answering, summarization, and reasoning. Their relatively small size
+makes it possible to deploy them in environments with limited resources such as
+a laptop, desktop or your own cloud infrastructure, democratizing access to
+state of the art AI models and helping foster innovation for everyone.
+## Quantizon types
+| quantization method | bits | size     | description                                            | recommended |
+|---------------------|------|----------|-----------------------------------------------------|-------------|
+| Q3_K_S              | 3    | 20.4 GB  | very small, high quality loss                       | ❌         |
+| Q3_K_L              | 3    | 26.4 GB  | small, substantial quality loss                     | ❌         |
+| Q4_0                | 4    | 26.4 GB  | legacy; small, very high quality loss | ❌         |
+| Q4_K_M              | 4    | 28.4 GB  | medium, balanced quality              | ✅         |
+| Q5_0                | 5    | 33.2 GB  | legacy; medium, balanced quality  | ❌         |
+| Q5_K_S              | 5    | 32.2 GB  | large, low quality loss | ✅         |
+| Q5_K_M              | 5    | 33.2 GB  | large, very low quality loss | ✅         |
+| Q6_K                | 6    | 38.4 GB  | very large, extremely low quality loss              | ❌         |
+| Q8_0                | 8    | 49.6 GB  | very large, extremely low quality loss | ❌         |
+## Usage
+You can use this model with the latest builds of LM Studio and llama.cpp.
+If you're new to the world of large language models, I recommend starting with LM Studio.
+<!-- description end -->