Papers
arxiv:2310.13017
Position Interpolation Improves ALiBi Extrapolation
Published on Oct 18, 2023
Authors:
Abstract
Linear position interpolation helps pre-trained models using rotary position embeddings (RoPE) to extrapolate to longer sequence lengths. We propose using linear position interpolation to extend the extrapolation range of models using Attention with Linear Biases (ALiBi). We find position interpolation significantly improves extrapolation capability on upstream language modelling and downstream summarization and retrieval tasks.
Models citing this paper 3
Datasets citing this paper 0
No dataset linking this paper
Cite arxiv.org/abs/2310.13017 in a dataset README.md to link it from this page.
Spaces citing this paper 11
Collections including this paper 0
No Collection including this paper
Add this paper to a
collection
to link it from this page.