vmlx
vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!
At a glance.
A compact read before the deeper capability notes and official setup links.
Core features.
Feature cards focus on what the tool helps users do, not generated setup commands.
vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!
Topics: anthropic-api, kvcache-compression, kvcache-optimization, kvcache-reuse, llm
GitHub stars: 701
Last pushed: 2026-06-19
Agent / Skill / MCP / Workflow fit.
This panel keeps technical format separate from the user-facing AI category.
Official setup path.
Generated install snippets are intentionally not mirrored here because they drift. The page links to source-owned setup docs instead.
Evidence and adoption notes.
These notes help a user decide whether to investigate the official project further.
Source repository last pushed at 2026-06-19T04:08:37Z.
Generated from source metadata; confirm operational details in the official project before adopting it.
Review the upstream license, maintenance activity, and issue history before using it in production.