MLX
Safetensors
mixtral

Mixtral went stupid/lazy and disobeyed the instructions when it couldn't print all outputs at one time

#12
by yinzhu - opened

When I was asking Mixtral to compose some video scripts (around 100 scripts) according to a specific structure, with the settings max tokens 1024 and Mixtral can output 7 scripts each time, the first batch of 7 scripts went well with the structure I instructed, but since the second batch:

  1. It started disobeying the structure.
  2. It became stupid (maybe lazy) because it couldn't understand some ordinary knowledge.

image.png

image.png

How should I resolve this?

Sign up or log in to comment