Quokka_2.7b / README.md
Corianas's picture
Adding Evaluation Results (#2)
30bfb67
metadata
license: apache-2.0
datasets:
  - the_pile
  - guanaco/guanaco
language:
  - en

Model Card for Cerebras 2.7b Dollyfied.

This is a finetuned model of Cerebras 2.7b model. using DataBricksLabs Dolly Framework

Model Details

Model Description

This is a finetuned version of cerebras' 2.7Billion paramater model that has been trained to follow instructions.

It was accomplished using DataBricks Dolly training tools, and was trained for 2 epochs.

Uses

This is a simple GPT chatbot that has been finetuned to understand instructions. Its knowledge about facts about the world is should be considered suspect at best.

Direct Use

If you have a use you put it to, Please let me know.

[More Information Needed]

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

Any form of use where any form of accuracy is needed. FOR THE LOVE OF GOD DO NOT FOLLOW MEDICAL ADVICE FROM THIS. or financial advice.

[More Information Needed]

Bias, Risks, and Limitations

Limitations... Yes, I am sure there are so so many.

[More Information Needed]

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: 8xA100s (accomplished while I was downloading the model I was actually training.)
  • Minutes used: 25
  • Cloud Provider: LambdaGPU
  • Compute Region: USA
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 29.72
ARC (25-shot) 31.06
HellaSwag (10-shot) 47.72
MMLU (5-shot) 24.8
TruthfulQA (0-shot) 40.14
Winogrande (5-shot) 55.49
GSM8K (5-shot) 0.38
DROP (3-shot) 8.43