🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Merge pull request #265 from thanhtcptit/master
Fix RoPE inner product equation & add note on the difference in implementation
V
vpj committed
33ab02281c2b928e6b32792909cc79cbdcfe1d6a
Committed by GitHub <[email protected]>
on 1/22/2026, 4:25:59 AM