🏢 CUHK
DAPE: Data-Adaptive Positional Encoding for Length Extrapolation
·3365 words·16 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 CUHK
DAPE: A novel data-adaptive positional encoding method dynamically adjusts positional information based on input context, improving transformer performance and length generalization.