"This model also supports the following FIM tokens"

by catarino - opened May 30

Discussion

catarino

May 30

This comment has been hidden

catarino

May 30

•

edited May 30

Can you give more info about how to apply this?

I'm using

<s>[INST]
{User}
[/INST]</s>
{Assistant}

without system prompt, because it wrecks inference.

Would love to know more how to use the system prompt and the FIM tokens. in prompt format, if possible.

bartowski

LM Studio Community org May 31

•

edited May 31

It's not well documented sadly, you also may have to re-download cause they forgot to include it in the originally uploaded tokenizer :)

From their github it looks like it should be:

<s>[SUFFIX]return a + b[PREFIX] def f(

which is definitely.. interesting..

catarino

May 31

Ah ok. Thank you @bartowski !

“They forgot to include it in the originally uploaded tokenizer”. This is also interesting albeit in an odd way 😬

bartowski

LM Studio Community org May 31

Yeah someone noticed that the FIM tokens evaluated to 0 so they uploaded a new tokenizer.model, this repo has the latest

catarino

May 31

•

edited May 31

Great.

regarding I downloaded Codestral-22B-v0.1-Q8_0.gguf from this repo yesterday. Should I download again?

btw, I noticed that I need to set layers to 50 (model has 56 max) to fix the issues on inference.
Having said that, this is the 1st model that I really feel is usable without awkward generations, endless repetitions or odd chars, etc.
Works great with both Pythagora and continue.dev.

Really happy for this.

Really thankful you for your work. :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment