SIGN IN SIGN UP

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

0 0 0 Python

flash attention

V
Varuna Jayasiri committed
9262c57f181a52130a64f65bc204fb5b3470f0fd
Parent: 4752644