AronXiang
/

RetrospexLLaMA3

Model card Files Files and versions Community

Edit model card

Model Card for Model ID

This model is trained by lora for Retrospex based on AgentInstruct and ShareGPT datasets. The base model is Llama-3-8B-Instruct.

Model Details

Model Description

Developed by: Convai NJU
Shared by [optional]: Convai NJU
Model type: Llama model
Language(s) (NLP): en
License: llama3
Finetuned from model [optional]: Llama-3-8B-Instruct

Model Sources

Repository: https://github.com/Yufei-Xiang/Retrospex.git

Training Details

Training Data

AgentInstruct: https://huggingface.co/datasets/THUDM/AgentInstruct

ShareGPT: https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered

Training Hyperparameters

fp16: True
lr: 2e-5
batch size: 8
lora r: 16
lora alpha: 64

Downloads last month: 0

Safetensors

Model size

8.03B params

Tensor type

F32

·

Inference API

Unable to determine this model's library. Check the docs .

Datasets used to train AronXiang/RetrospexLLaMA3