Model Tells You Where to Merge: Adaptive KV Cache Merging for LLMs on Long-Context Tasks Paper • 2407.08454 • Published Jul 11
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges Paper • 2409.01071 • Published 23 days ago • 26
Spinning the Golden Thread: Benchmarking Long-Form Generation in Language Models Paper • 2409.02076 • Published 22 days ago • 9