SIGN IN SIGN UP
rasbt / LLMs-from-scratch UNCLAIMED

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

0 0 0 Jupyter Notebook

Sliding window KV Cache bug fix (#925)

1. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset when window_size > context_length
2. Fix bug because of KV cache and GPT's ptr pointer doesn't get reset
3. Fix KV Cache import issue for gpt_with_kv_cache_optimized
T
talentJay-ux committed
d7f178d28bdf56732b01544d5718bfed637ac6f9
Parent: a11965f
Committed by GitHub <[email protected]> on 12/16/2025, 12:47:01 AM