Help Replicating

#15
by edwards98 - opened

Out of curiosity how did you find the vector(s)? Was it a specific layer and position? Was there a specific activation layer type that was good like resid_pre? 3.1 seems harder to break for me :(

I used the technique described in the article, so just checking the outputs that were provided. Nothing specific.

Sign up or log in to comment