PerRing
/

llama-3-Korean-Bllossom-8B-fp8

@@ -1,199 +1,91 @@
----
-library_name: transformers
-tags: []
----
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
-## Model Details
-### Model Description
-<!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
-- **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
-- **Demo [optional]:** [More Information Needed]
-## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
-### Direct Use
-<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
-[More Information Needed]
-### Downstream Use [optional]
-<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
-[More Information Needed]
-### Out-of-Scope Use
-<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
-[More Information Needed]
-## Bias, Risks, and Limitations
-<!-- This section is meant to convey both technical and sociotechnical limitations. -->
-[More Information Needed]
-### Recommendations
-<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
-Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
-## How to Get Started with the Model
-Use the code below to get started with the model.
-[More Information Needed]
-## Training Details
-### Training Data
-<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[More Information Needed]
-### Training Procedure
-<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
-#### Preprocessing [optional]
-[More Information Needed]
-#### Training Hyperparameters
-- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
-#### Speeds, Sizes, Times [optional]
-<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
-[More Information Needed]
-## Evaluation
-<!-- This section describes the evaluation protocols and provides the results. -->
-### Testing Data, Factors & Metrics
-#### Testing Data
-<!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
-#### Factors
-<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
-[More Information Needed]
-#### Metrics
-<!-- These are the evaluation metrics being used, ideally with a description of why. -->
-[More Information Needed]
-### Results
-[More Information Needed]
-#### Summary
-## Model Examination [optional]
-<!-- Relevant interpretability work for the model goes here -->
-[More Information Needed]
-## Environmental Impact
-<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
-Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
-- **Hardware Type:** [More Information Needed]
-- **Hours used:** [More Information Needed]
-- **Cloud Provider:** [More Information Needed]
-- **Compute Region:** [More Information Needed]
-- **Carbon Emitted:** [More Information Needed]
-## Technical Specifications [optional]
-### Model Architecture and Objective
-[More Information Needed]
-### Compute Infrastructure
-[More Information Needed]
-#### Hardware
-[More Information Needed]
-#### Software
-[More Information Needed]
-## Citation [optional]
-<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
-**BibTeX:**
-[More Information Needed]
-**APA:**
-[More Information Needed]
-## Glossary [optional]
-<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
-[More Information Needed]
-## More Information [optional]
-[More Information Needed]
-## Model Card Authors [optional]
-[More Information Needed]
-## Model Card Contact
-[More Information Needed]

+```
+pip install --upgrade accelerate fbgemm-gpu torch
+```
+```python
+from transformers import FbgemmFp8Config, AutoModelForCausalLM, AutoTokenizer
+model_name = "MLP-KTLim/llama-3-Korean-Bllossom-8B"
+quantization_config = FbgemmFp8Config()
+model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto", quantization_config=quantization_config)
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+```
+```python
+PROMPT = '''You are a helpful AI assistant. Please answer the user's questions kindly. 당신은 유능한 AI 어시스턴트 입니다. 사용자의 질문에 대해 친절하게 답변해주세요.'''
+instruction = "서울의 유명한 관광 코스를 만들어줄래?"
+messages = [
+    {"role": "system", "content": f"{PROMPT}"},
+    {"role": "user", "content": f"{instruction}"}
+    ]
+input_ids = tokenizer.apply_chat_template(
+    messages,
+    add_generation_prompt=True,
+    return_tensors="pt"
+).to(model.device)
+terminators = [
+    tokenizer.eos_token_id,
+    tokenizer.convert_tokens_to_ids("<|eot_id|>")
+]
+outputs = model.generate(
+    input_ids,
+    max_new_tokens=2048,
+    eos_token_id=terminators,
+    do_sample=True,
+    temperature=0.6,
+    top_p=0.9
+)
+print(tokenizer.decode(outputs[0][input_ids.shape[-1]:]))
+```
+```
+물론입니다! 서울은 다양한 문화와 역사, 그리고 현대적인 매력을 겸비한 도시로, 많은 관광 명소를 자랑합니다. 아래는 서울의 유명한 관광 코스입니다:
+### 코스 1: 역사와 문화의 거리
+1. **경복궁**
+   - 서울의 대표적인 궁궐로, 조선 왕조의 중심지였습니다. 경복궁 내에는 왕궁, 정원, 그리고 다양한 전시가 있습니다.
+2. **북촌 한옥마을**
+   - 전통 한옥이 잘 보존된 마을로, 서울의 전통적인 생활상을 체험할 수 있습니다. 전통 한옥을 방문하여 한옥의 구조와 생활 방식을 배울 수 있습니다.
+3. **인사동**
+   - 전통 문화와 현대 예술이 조화를 이루는 거리입니다. 전통 수공예품 가게, 미술관, 그리고 전통 음식점이 많습니다.
+4. **불국사**
+   - 경복궁 인근에 위치한 불국사에는 불교 관련 전시와 함께 불교 기념품을 구입할 수 있는 곳이 있습니다.
+### 코스 2: 현대와 자연의 조화
+1. **남산 서울타워**
+   - 남산 정상에 위치한 서울타워에서 서울의 전경을 감상할 수 있습니다. 타워 내에는 전망대와 식당, 그리고 다양한 전시가 있습니다.
+2. **남산 힐링로**
+   - 남산 정상까지 오르기 전에 남산 힐링로를 걸으며 서울의 아름다운 경치를 즐길 수 있습니다.
+3. **한강공원**
+   - 서울의 중심에 위치한 한강공원에서는 보트 타기, 자전거 타기, 그리고 산책을 즐길 수 있습니다. 또한, 다양한 공연과 행사가 열립니다.
+4. **동대문 디자인 플라자 (DDP)**
+   - 현대적인 건축물로 유명한 DDP는 전시와 쇼핑을 즐길 수 있는 곳입니다. 다양한 디자이너와 브랜드의 제품을 체험할 수 있습니다.
+### 코스 3: 쇼핑과 엔터테인먼트
+1. **명동**
+   - 서울의 대표적인 쇼핑 거리로, 다양한 브랜드와 전통 가게가 모여 있습니다. 명동에는 다양한 음식점과 카페도 있습니다.
+2. **여의도**
+   - 국제적인 기업과 정부 기관이 모여 있는 여의도는 또한 쇼핑과 레스토랑이 풍부합니다. 여의도 공원도 방문해 보세요.
+3. **홍대**
+   - 젊음의 거리로 유명한 홍대는 다양한 클럽과 카페, 그리고 전통 음식점이 있습니다. 밤에 활기가 넘치는 곳입니다.
+4. **이태원**
+   - 다양한 외국인들이 모이는 이태원은 외국 음식과 커피 가게가 많습니다. 또한, 다양한 소품 가게와 전통 가게도 있습니다.
+이 코스는 서울의 다양한 면모를 체험할 수 있는 길잡이입니다. 각 코스마다 서울의 역사, 문화, 자연, 쇼핑, 그리고 엔터테인먼트를 즐길 수 있습니다. 서울에 방문하시면 꼭 체험해 보세요!<|eot_id|>
+```