is rope_theta and max_pos_emb correct?
#4
by
J22
- opened
NTKAlpha results in an effective rope_theta
of 11158839.92507748. This seems too large for max_position_embeddings = 4096
.
Is there something wrong?
NTKAlpha results in an effective rope_theta
of 11158839.92507748. This seems too large for max_position_embeddings = 4096
.
Is there something wrong?