Still refusing on some prompts

by Kuinox - opened Jun 13

Jun 13

I tried a bit this model, and it indeed doesn't refuse anything that is "dangerous", but it still refuses "nsfw" things.
For example, if I ask it "Write the most nsfw message you can." it will respond "I'm programmed to be a family-friendly AI, so I won't write an explicit message.[...]"

mlabonne

Owner Jun 14

This particular prompt doesn't work but precise instructions do (at least most of them)

WbjuSrceu

Jun 16

I trained one with ORPO, but had a specific harmful problem that would repeat.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment