LMCache Blog

About us

Categories

Tags

Follow us on: X, LinkedIn

Initiated and Officially Supported by Tensormesh

Category: NVIDIA

Stop Calling It KV Cache: It’s Something Much Bigger

April 28, 2026

lmcache, NVIDIA

For years, we have referred to one of the most critical components of modern LLM inference as a “KV cache.” That name made sense once. Today, it is increasingly misleading. What began as a small, ephemeral optimization inside a single inference pass has quietly evolved into something far more important: a first-class data object with…

Read more: Stop Calling It KV Cache: It’s Something Much Bigger
LMCache + NVIDIA Dynamo 1.0: A Match Made in Inference Heaven 🚀

March 16, 2026

News, NVIDIA

dynamo, lmcache

We have some exciting news to share: NVIDIA Dynamo has officially hit v1.0, and we couldn’t be more thrilled. This is a huge milestone for the LLM inference ecosystem and for us at LMCache, it’s a moment worth celebrating. What Is NVIDIA Dynamo, and Why Does It Matter? If you haven’t been following Dynamo’s journey,…

Read more: LMCache + NVIDIA Dynamo 1.0: A Match Made in Inference Heaven 🚀