Why is the Pixtral activation function "gelu" when the reference code uses "silu"?

#10
by mgoin - opened

The activation function should indeed by "silu" - would be nice if we could correct the implementation here

mgoin changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment