File size: 1,420 Bytes
8548a78
 
a07ff26
 
 
 
 
 
 
 
 
 
 
 
8548a78
 
 
a07ff26
8548a78
a07ff26
8548a78
 
 
 
 
a07ff26
 
8548a78
a07ff26
 
 
 
 
 
 
8548a78
 
 
a07ff26
 
 
 
 
8548a78
 
 
a07ff26
 
 
 
 
 
 
 
 
8548a78
 
 
a07ff26
8548a78
a07ff26
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
---
library_name: transformers
tags:
- gaudi
- llama3
- llm
- optimum-habana
- text-generation-inference
license: apache-2.0
datasets:
- tatsu-lab/alpaca
language:
- en
pipeline_tag: text-generation
---


# Model Card for Model ID

This model was fine-tuned from meta-llama/Meta-Llama-3-8B

## Model Details

### Model Description

The gopalakrishnan-d/meta-llama3-8b-alpaca-v1 model is a fine-tuned variant of the Llama3 architecture with 8 billion parameters.
This version has been specifically enhanced for better performance on diverse language tasks, utilizing the Gaudi 2 Accelerator to optimize the training process.

- **Hardware Type:** Intel Gaudi2 Accelerator
- **Cloud Provider:** Intel® Tiber™ Developer Cloud
- **Developed by:** gopalakrishnan-d
- **Model type:** Fine-Tuned LLM
- **Language(s) (NLP):** English
- **License:**Apache 2.0 License**
- **Finetuned from model:** meta-llama/Meta-Llama-3-8B

## Uses

 - Customer Service Chatbots
 - Content Generation Tools
 - Educational Tutoring Systems
 - Workflow Automation Systems
 - Personalized Recommendation Engines

#### Training Hyperparameters

    - learning_rate: 5e-06 (Low Rate)
    - train_batch_size: 8
    - seed: 100
    - gradient_accumulation_steps: 1
    - optimizer: Adam 
    - lr_scheduler_type: linear
    - lr_scheduler_warmup_ratio: 0.03
    - lora_rank=16 
    - lora_alpha=32

## Evaluation

    Will be update..!

### Results