EpistemeAI2
/

Fireball-Llama-3.1-8B-Philos-Reflection-KTO-beta-f16-gguf

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Uploaded model

Developed by: EpistemeAI2
License: apache-2.0
Finetuned from model : EpistemeAI2/Fireball-Llama-3.1-8B-Philos-Reflection

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 11

GGUF

Model size

8.03B params

Architecture

llama

16-bit

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for EpistemeAI2/Fireball-Llama-3.1-8B-Philos-Reflection-KTO-beta-f16-gguf

Base model

meta-llama/Llama-3.1-8B

Finetuned

meta-llama/Llama-3.1-8B-Instruct

Quantized

unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Finetuned

EpistemeAI2/Fireball-Alpaca-Llama3.1-8B-Philos

Finetuned

EpistemeAI2/Fireball-Alpaca-Llama3.1.08-8B-Philos-C-R1

Finetuned

EpistemeAI2/Fireball-Llama-3.1-8B-Philos-Reflection

Quantized

(4)

this model