Uploaded model

Developed by: samsonleegh
License: apache-2.0
Finetuned from model: unsloth/llama-3-8b-bnb-4bit

Finetuned to generate pandas codes given a dataframe and user query.
~100 datasets were taken from kaggle https://www.kaggle.com/datasets?search=Tabular+data
These dataset were used to generate 390 sets of data queries and pandas code answers via llama3-70b https://www.kaggle.com/code/samsonleegh/sampling-data-qns-and-pandas-ans-from-dataset
Finetuned llama3-8b-4bit with LoRA 16 adapters on 350 queries and answers pair https://colab.research.google.com/drive/1UkqjHIq-mP22AfHZCWz4kiU7hcWaXfgi?usp=sharing
Compare ROUGE score of original vs finetuned model on 40 queries and answers pair

ROUGE Score Comparison

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.