SIGN IN SIGN UP
rasbt / LLMs-from-scratch UNCLAIMED

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

0 0 0 Jupyter Notebook

Fix bug in masking when kv cache is used. (#697)

* Fix bug in masking when kv cache is used.

* add tests

* dd tests

* upd

* add kv cache test to gh workflow

* explicit mask slicing

* upd

---------

Co-authored-by: rasbt <[email protected]>
M
Martin Ma committed
6522be94beb2640ae2a811249c73c67d0845567c
Parent: 37b26c2
Committed by GitHub <[email protected]> on 6/23/2025, 6:12:56 PM