About us

Categories

Tags

Follow us on: X, LinkedIn

Initiated and Officially Supported by Tensormesh

A New Chapter for LMCache and the KV Cache Community

By

Junchen Jiang

TL;DR: A key contributor to the LMCache community just secured a major investment. This will greatly accelerate our mission of building the best KV cache library for every developer.

Come join us in building the future AI-native data layer!

An Independent Layer for the KV Cache

KV cache is no longer just a byproduct of the LLM inference process. It stores important information that can persist beyond the lifecycle of a query, an inference cluster, and even an LLM.

As a result, both industry and academia have started to realize that KV caches should be managed, distributed, and optimized independently of inference engines, storage devices, accelerators, and inference orchestrators.

In short, a new independent layer for AI-native dataβ€”the KV cacheβ€”is being born. This new reality calls for new open-source projects, a community of domain experts, startups, and eventually products.

LMCache

From the very beginning, LMCache has been proud to play a central role in this evolution.

LMCache has pioneered the first open-source project and created a community of KV cache experts focused on managing KV caches when they are not in GPU memory. It now has over 220 contributors, with contributions from over 30 industry partners.

LMCache is designed to be an independent layer, with integrations across a wide range of inference engines, storage vendors, accelerator vendors, and inference orchestrators.

Tensormesh

Yet pushing the frontier of KV cache and maintaining such a project is an expensive endeavor. Investment is needed.

Tensormesh was founded a year ago by researchers from UChicago who started the LMCache project. LMCache has since grown substantially, with a vibrant and diverse contributor community, but Tensormesh continues to help maintain and improve key parts of the project.

Recently, Tensormesh secured $20M in funding, on top of its original $4.5M seed round, with major strategic investments from AMD, NVIDIA, and CoreWeave.

These investments show not only a commitment from major industry partners, but more importantly, a broader industry consensus that improving KV cache layer performance is key to unlocking the best performance from accelerators, storage, and inference systems.

Tensormesh exists to boost the adoption and development of LMCache and deepen its connections with other ecosystem partners. Tensormesh is committed to improving the performance, compatibility, and reliability of LMCache.

LMCache is not changing its open-source commitment.

Join Us

The LMCache community includes some of the best experts in storage, inference optimization, and machine learning algorithms.

The future of AI infrastructure is open. Join the LMCache community and help build the open-source KV cache layer together!

A cheerful cartoon beaver wearing glasses and a hard hat, holding a wrench, stands by a flowing stream surrounded by trees and rocks, with text inviting people to join the LMCache Community for AI-native data layer development.

π’π­πšπ² 𝐜𝐨𝐧𝐧𝐞𝐜𝐭𝐞𝐝 𝐰𝐒𝐭𝐑 π‹πŒπ‚πšπœπ‘πž:

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Discover more from LMCache Blog

Subscribe now to keep reading and get access to the full archive.

Continue reading