is rope_theta and max_pos_emb correct?

#4
by J22 - opened

NTKAlpha results in an effective rope_theta of 11158839.92507748. This seems too large for max_position_embeddings = 4096.

Is there something wrong?

Sign up or log in to comment