From-scratch PyTorch implementation of Google's TurboQuant (ICLR 2026) for LLM KV cache compression. 5x compression at 3-bit with 99.5% attention fidelity.
Quick setup
Get started by creating a new file or uploading an existing file. We recommend every repository include a README, LICENSE, and .gitignore.
https://gitmorph.com/tonbistudio/turboquant-pytorch.git CREATE A NEW REPOSITORY ON THE COMMAND LINE
touch README.md git init git checkout -b main git add README.md git commit -m "first commit" git remote add origin https://gitmorph.com/tonbistudio/turboquant-pytorch.git git push -u origin main
PUSH AN EXISTING REPOSITORY
git remote add origin https://gitmorph.com/tonbistudio/turboquant-pytorch.git
git push -u origin main