adpater_config.json의 역할: adapter_config.json은 어댑터 기반 모델 트레이닝에 사용되는 설정 파일입니다. 어댑터는 신경망 모델에서 기존의 레이어 구조를 변경하지 않고 새로운 작업이나 도메인에 대해 모델을 미세 조정하는 기술입니다. 이 파일은 레이어에 추가되는 어댑터의 수, 크기, 활성화 함수, 학습률 등과 같은 중요한 하이퍼파라미터를 정의합니다. 어댑터를 사용하면 전체 모델을 재학습할 필요 없이 효율적으로 미세 조정할 수 있어, 계산 비용과 시간을 절약할 수 있습니다. 이런 방식은 특히 큰 모델에 유용하며, 다양한 작업에 하나의 모델을 적용할 수 있는 유연성을 제공합니다.

테스트 데이터에 대한 설명: 테스트 데이터셋 'nsmc'는 한국어 영화 리뷰 데이터셋으로, 감성 분석 작업에 자주 사용됩니다. 이 데이터셋은 리뷰 텍스트와 해당 리뷰가 긍정적인지 부정적인지를 나타내는 라벨로 구성되어 있습니다. 이 코드에서는 이 데이터셋의 일부를 사용하여 모델을 테스트합니다. 테스트 데이터셋을 사용하는 목적은 훈련된 모델이 실제 세계 데이터에 대해 얼마나 잘 작동하는지를 평가하는 것입니다. 이 과정에서 모델의 일반화 능력과 강인성을 검증할 수 있습니다.

이 코드에서 사용된 테스트 조건은 다음과 같습니다:

데이터셋과 샘플 크기:

사용된 데이터셋은 'nsmc'로, 한국어 영화 리뷰를 담고 있습니다. 이 데이터셋에서 1000개의 테스트 샘플을 사용했습니다. 각 샘플은 텍스트 리뷰와 이 리뷰가 긍정적인지 부정적인지를 나타내는 라벨로 구성됩니다. 입력 데이터의 처리:

테스트 과정에서 각 리뷰 텍스트는 모델에 입력되기 전에 특정 포맷으로 변환됩니다. 이 포맷은 모델이 이해할 수 있도록 텍스트를 구조화하는 방법에 따라 달라질 수 있으며, 일반적으로 토큰화 과정을 포함합니다. 모델의 예측과 평가 방법:

모델은 각 테스트 샘플에 대해 긍정적 또는 부정적인 감성을 예측합니다. 이 예측은 실제 라벨(긍정/부정)과 비교되어 모델의 성능을 평가합니다. 성능 평가 지표:

테스트 과정에서 주로 사용된 성능 평가 지표는 정확도(accuracy)입니다. 정확도는 모델이 올바르게 예측한 샘플의 비율로 계산됩니다. 성능 평가를 위한 계산 방법:

코드는 True Positive (TP), True Negative (TN), False Positive (FP), False Negative (FN)를 계산하여 모델의 정확도를 도출합니다. 이 값들은 모델이 실제 긍정(positive) 라벨을 긍정으로, 실제 부정(negative) 라벨을 부정으로 정확히 예측한 경우와 그렇지 못한 경우를 나타냅니다.

테스트 조건 및 방법론: 모델 테스트는 주어진 입력에 대한 모델의 예측과 실제 라벨을 비교하는 과정입니다. 이 코드에서는 True Positive (TP), True Negative (TN), False Positive (FP), False Negative (FN)의 네 가지 기본 지표를 사용하여 모델의 예측 성능을 평가합니다. TP와 TN은 모델이 올바르게 예측한 경우, FP와 FN은 잘못 예측한 경우를 나타냅니다. 이러한 지표들은 모델의 정확도뿐만 아니라 다른 중요한 성능 지표들(예: 정밀도, 재현율)을 계산하는 데에도 사용됩니다.

결과 해석 정확도 87.8%: 이는 전체 테스트 데이터 중 약 87.8%를 모델이 올바르게 분류했다는 것을 의미합니다. 이는 상당히 높은 정확도이며, 모델이 대부분의 리뷰를 올바르게 감성 분석했음을 나타냅니다.

True Positives와 True Negatives: 높은 TP와 TN은 모델이 긍정적 및 부정적 리뷰를 대체로 잘 분류하고 있음을 나타냅니다.

False Positives와 False Negatives: 비교적 낮은 FP와 높은 FN은 모델이 부정적 리뷰를 긍정으로 잘못 분류할 가능성이 낮지만, 긍정적 리뷰를 부정으로 잘못 분류하는 경향이 있음을 나타냅니다. 이는 모델이 긍정적 표현을 부정적으로 오해하는 경향이 있을 수 있음을 의미합니다.

성능 향상을 위한 고려 사항 데이터 불균형: 데이터셋에 긍정 또는 부정 리뷰의 불균형이 있을 경우, 이는 모델의 학습에 영향을 미칠 수 있습니다. 균형 잡힌 데이터셋을 사용하거나, 가중치 조정과 같은 기법을 통해 이를 보정할 수 있습니다.

Model Card for Model ID

Model Details

Model Description

**Model type: Causal Language Model
**Language(s) (NLP): Korean
**Finetuned from model [optional]: Finetuned from 'KT-AI/midm-bitext-S-7B-inst-v1'

Uses

Direct Use

This model is primarily used for sentiment analysis on Korean text, particularly in classifying movie reviews as positive or negative.

Downstream Use [optional]

The model can be adapted for other types of Korean text classification tasks such as customer feedback analysis, social media sentiment analysis, etc.

Out-of-Scope Use

Bias, Risks, and Limitations

This model, while performing with high accuracy, may exhibit biases present in the training data, potentially leading to skewed results in certain scenarios. Further evaluation and monitoring are recommended to identify and mitigate these biases.

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

The model was fine-tuned on the NSMC (Naver Sentiment Movie Corpus) dataset, consisting of Korean movie reviews with binary sentiment labels.

Training Procedure

Text data were tokenized using a Korean-specific tokenizer. Standard preprocessing steps such as lowercasing and removal of special characters were applied.

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

**Training regime:**Learning Rate: 1e-4 Batch Size: 8 Epochs: 3

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

The model was evaluated on a separate test set extracted from the NSMC dataset, ensuring no overlap with the training data.

Factors

The evaluation focused on the model's ability to accurately classify sentiment in Korean movie reviews.

Metrics

Metrics used include Accuracy, Precision

Results

The model achieved an accuracy of 87.8%

Summary

True Positives와 True Negatives: 높은 TP와 TN은 모델이 긍정적 및 부정적 리뷰를 대체로 잘 분류하고 있음을 나타냅니다.

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

Hardware Type: [More Information Needed]
Hours used: [More Information Needed]
Cloud Provider: [More Information Needed]
Compute Region: [More Information Needed]
Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Training procedure

The following bitsandbytes quantization config was used during training:

quant_method: bitsandbytes
load_in_8bit: False
load_in_4bit: True
llm_int8_threshold: 6.0
llm_int8_skip_modules: None
llm_int8_enable_fp32_cpu_offload: False
llm_int8_has_fp16_weight: False
bnb_4bit_quant_type: nf4
bnb_4bit_use_double_quant: False
bnb_4bit_compute_dtype: bfloat16

Framework versions

PEFT 0.7.0

cheonyumin
/

lora-midm-7b-food-order-understanding

Model Card for Model ID

Model Details

Model Description

Uses

Direct Use

Downstream Use [optional]

Out-of-Scope Use

Bias, Risks, and Limitations

Recommendations

How to Get Started with the Model

Training Details

Training Data

Training Procedure

Preprocessing [optional]

Training Hyperparameters

Speeds, Sizes, Times [optional]

Evaluation

Testing Data, Factors & Metrics

Testing Data

Factors

Metrics

Results

Summary

Model Examination [optional]

Environmental Impact

Technical Specifications [optional]

Model Architecture and Objective

Compute Infrastructure

Hardware

Software

Citation [optional]

Glossary [optional]

More Information [optional]

Model Card Authors [optional]

Model Card Contact

Training procedure

Framework versions

Model tree for cheonyumin/lora-midm-7b-food-order-understanding