← Back to post

Posts quoting @hdevalence.bsky.social

1 post quote this

But I don’t care about the fact that I only get the final layer up to some orthogonal matrix because I only care about getting the model weights up to symmetry. Symmetry at every step of the model architecture. So I want an explanation of where SPECIFICALLY this breaks.

Post not found
Post not found
Post not found
Post not found
(63 comments)replyquoteparent