Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Exploiting Local KV Cache Asymmetry for Long-Context LLMs

arxiv.org

2 points by PaulHoule 7 hours ago