BEE-spoke-data/beecoder-220M-python

This is BEE-spoke-data/smol_llama-220M-GQA fine-tuned for code generation on:

filtered version of stack-smol-XL
deduped version of 'algebraic stack' from proof-pile-2
cleaned and deduped pypi (last dataset)

This model (and the base model) were both trained using ctx length 2048.

examples

Example script for inference testing: here

It has its limitations at 220M, but seems decent for single-line or docstring generation, and/or being used for speculative decoding for such purposes.

The screenshot is on CPU on a laptop.

Downloads last month: 12

Safetensors

Model size

218M params

Tensor type

BF16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for BEE-spoke-data/beecoder-220M-python

Base model

BEE-spoke-data/smol_llama-220M-GQA

Finetuned

(12)

this model

Quantizations

1 model

Datasets used to train BEE-spoke-data/beecoder-220M-python

Collection including BEE-spoke-data/beecoder-220M-python

finetuned smol 220M

Collection

smol_llama 220M fine-tunes we did • 6 items • Updated Apr 29 • 1