10/ Consistent Middle Enhancement in LLMs – proposes an approach to tune an LLM to effectively utilize information from the middle part of the context; it first proposes a training-efficient method to extend LLMs to longer context lengths (e.g., 4K -> 256K).
LLM Context Window Enhancement: Extending to 256K Tokens
By
–