Discover tools that power great ideas
TurboQuant compresses high-dimensional vectors for LLM KV caches and vector search to reduce memory bottlenecks while avoiding accuracy loss.