Context extension?

by droussis - opened Oct 1

Oct 1

Hi and congratulations for the great model! A European LLM was long overdue!

I observed that the base model has 4k max_position_embeddings, while the Instruct model has 8k.
Did you follow a specific methodology to increase its context length? I didn't see anything in the paper.

Thank you in advance and please correct me if I am wrong.

P.S. Also wanted to ask if you plan to extend the context length of your models in the future.

phmartins

UTTER - Unified Transcription and Translation for Extended Reality org Oct 1

Hi. Thank you.

This is actually a typo. I'll fix it.
But we're planning to experiment increasing the context length for the bigger models that we're training.

phmartins changed discussion status to closed Oct 1

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment